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IMPROVED ADENOVIRUS AND METHODS OF USE THEREOF 

This invention was supported by the National 
Institute of Health Grant No. P30 DK 47757. The United 
5 States government has rights in this invention. 

rislfl ?f th? Invention 

The present invention relates to the field of 
vectors useful in somatic gene therapy and the production 
10 thereof. 

PacKqrovmfl Qf the Invention 

Human gene therapy is an approach to treating human 
disease that is based on the modification of gene 

15 expression in cells of the patient. It has become 
apparent over the last decade that the single most 
outstanding barrier to the success of gene therapy as a 
strategy for treating inherited diseases, cancer, and 
other genetic dysfunctions is the development of useful 

20 gene transfer vehicles. Eukaryotic viruses have been 

employed as vehicles for somatic gene therapy. Among the 
viral vectors that have been cited frequently in gene 
therapy research are adenoviruses. 

Adenoviruses are eukaryotic DNA viruses that can be 

25 modified to efficiently deliver a therapeutic or reporter 
transgene to a variety of cell types. Recombinant 
adenoviruses types 2 and 5 (Ad2 and Ad5, respectively) , 
which cause respiratory disease in humans, are currently 
being developed for gene therapy. Both Ad2 and Ad5 

30 belong to a subclass of adenovirus that are not 
associated with human malignancies. Recombinant 
adenoviruses are capable of providing extremely high 
levels of transgene delivery to virtually all cell types, 
regardless of the mitotic state. High titers (10 13 

35 plaque forming units/ml) of recombinant virus can be 

easily generated in 293 cells (the adenovirus equivalent 
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to retrovirus packing cell lines) and * 
extended periods without appreciable losses. The 
extended p delivering a therapeutic 

efficacy of this syswu „„„otic imbalance 

. • „„ that complements a genetic 
transgene in vivo that co P vari ous 
ha8 been demonstrated in anima. models J 

[Y. Watanabe, WTfW fl twrt* ^ 2 « 8 

Tanzawa et al, ZEE^J^exfi' llftd)-« l 84 
(Uiioj, ~ Tanzawa . ^ Med.. 

222(11983) :288-296 (1983), 8. I ^ ^ 

fr 3 ^^- -i) -deed, a 

SSSf^^ aefective -^.^ ' 
CDNA for the cystic fibrosis ^ CT 

(CFTR) has been approved for use in w- 
clinical trials [see, e.g., J. Wil8 °"' fl f^7 safe ty of 
692 (Oct. 21, 1993)). Further support of the saf 
re combinant adenoviruses for in human 

extensive experience of live aoeno 

populations. „ rtit ,_ ed of a linear, 

Human adenoviruses are comprised a is 

approximately 36 Kb ^uble-stranded « t-^ ^ ^ 
divided into 100 map units (m.u.) , ea ^ al 

^ earlv (El through E4) and late ^* 

organized into early ( re or after the 

L5) regions, based on expressi H orwitz, 
initiation of viral I>H A *n^i~'J£ ' mBS , L td. , 
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therepy contain deletion . t ^„ virus 1. 



WO 96/13597 PCI7US95/14017 



3 

embryonic kidney cell line containing a functional 
adenovirus Ela gene which provides a transacting Ela 
protein, the 293 cell [ATCC CRL1573]. El-deleted viruses 
are capable of replicating and producing infectious virus 
5 in the 293 cells, which provia^ Ela and Elb region gene 
products in trans. The resulting virus is capable of 
infecting many cell types and can express the introduced 
gene (providing it carries its own promoter) , but cannot 
replicate in a cell that does not carry the El region DHA 

10 unless the cell is infected at a very high multiplicity 
of infection. 

However, in vivo studies revealed transgene 
expression in these El deleted vectors was transient and 
invariably associated with the development of severe 

15 inflammation at the site of vector targeting [S. 

Ishibashi et al, J. Clin, Invest ., 22:1885-1893 (1994); 
J. M. Wilson et al, Proc. Nat l- Acad. Sci. . USA, £5:4421- 
4424 (1988); J. M. Wilson et al, Clin. Bio.. 2:21-26 
(1991) ; M. Grossman et al, Som. Cel l- and Mol, Gen., 

20 17:601-607 (1991)]. One explanation that has been 

proposed to explain this finding is that first generation 
recombinant adenoviruses, despite the deletion of El 
genes, express low levels of other viral proteins. This 
could be due to basal expression from the unstimulated 

25 viral promoters or transactivation by cellular factors. 
Expression of viral proteins leads to cellular immune 
responses to the genetically modified cells, resulting in 
their destruction and replacement with nontransgene 
containing cells. 

30 There yet remains a need in the art for the 

development of additional adenovirus vector constructs 
for gene therapy. 
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^"^^f^^invenUcn provi.es the exponents 
of . ^re^nant adenovirus predion s,.t-. One 
'oUcLt i. . shuttle Plesnid. pMA. that 

adenovirus ci.-elenent. -~*« ^T, ~. 

virion encapeidation m U deleted of all viral g 
Z. vector oarrie. a selected tr.nsgen. under the 
fonrrol ox . .elected promoter end oth« conventxonal 
..ctor/plasetd regulatory components. ™* 

„,. . helper adenovirus, which alone or vitn 

exponent is a help . u££lcient g(m . .fences 

packaging cell line, supplies 

„, ,„ a oroductive virol infection. In e 
=S ^ helper vims ha. heen eltered 

^tntain .edification, to •--JJ^ 

v H i rect efficient packaging, so as to 
which direct erncj. * function of the helper 

disable or -cripple- the packaging function 
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virus or its ability to replicate. 8 . 

I„ another aspect, the presen^nvention P 

recombinant Zs^Jo^Z virus 

use of the components above. cis -ele»ents 
comprises an adenovirus capsid, adenovirus els 
necessary for replication and virion 
is deleted of all viral genes (i.e., all vir pe 
reading frames) . This virus ^J^^ and 
transgene under the control of a selecte p 
oth er conventional vector regulatory components ^i 
AdA recombinant virus is characterized^ high titer 

. o11verv to a host cell and the ability w 
transgene delivery to • 

chromosome • xn one . ^ of th e 

transgene a reporter gene. Another embodiment of the 

eminent virus contains a therapeutic transgene. 
recombinant virus inve ntion provides a method 

in another aspect, the invention P 
for producing the above-described recombinant Ada virus 
D ; /o-fransfecting a cell line (either a packaging cell 
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line or a non-packaging cell line) with a shuttle vector 
or plasmid and a helper adenovirus as described above, 
wherein the transfected cell generates the AdA virus . 
The AdA virus is subsequently isolated and purified 
5 therefrom. 

In yet a further aspect, the invention provides a 
method for delivering a selected gene to a host cell for 
expression in that cell by administering an effective 
amount of a recombinant AdA virus containing a 
10 therapeutic transgene to a patient to treat or correct a 
genetically associated disorder or disease. 

Other aspects and advantages of the present 
invention are described further in the following detailed 
description of the preferred embodiments thereof. 

15 

Brief Description of the Figures 

Fig. 1A is a schematic representation of the 

organization of the major functional elements that define 

the 5 9 terminus from Ad5 including an inverted terminal 
20 repeat (ITR) and a packaging/ enhancer domain. The TATA 

box of the £1 promoter (black box) and E1A 

transcriptional start site (arrow) are also shown. 
Fig. IB is an expanded schematic of the 

packaging/ enhancer region of Fig. 1A, indicating the five 
25 packaging (PAC) domains (A-repeats) , I through V. The 

arrows indicate the location of PCR primers referenced in 

Figs. 9A and 9B below. 

Fig. 2 A is a schematic of shuttle vector 

pAdA.CMVLacZ containing 5» ITR from Ad5 r followed by a 
30 CMV promoter/ enhancer, a LacZ gene, a 3 1 ITR from Ad5, 

and remaining plasmid sequence from plasmid pSP72 

backbone. Restriction endonuclease enzymes are 

represented by conventional designations in the plasmid 

constructs • 

35 
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Fia 2B is a schematic of the shuttle vector 
dig eslea*vit h BcoRX to release the modified Ada genome 

fro » the P SP72 plasmid ba ^ ne ; ^ function of 

Fig. 2C is a schematic de P iCti0 %°^„ delcted 
5 t„e vector system. In the pre-ence of an El£ let 
h^loer virus Ad.CBhpAP which encodes a reporter mi 
To T-an placenta alkaline phosphatase (hpAP). ~ 
loI.CMVLacZ genome is packed into 
capsids. distinguishable from the helper virions y 

10 '^^T^l- » »0: 13 report the top OKA 

t J ^ /ouole-stranded plasmid pAdA.CHVI.cZ. The 
strand of the aouoi rea dily obtained by one of 

complementary sequence may be readily o fQllowing 
* «. The sequence includes the following 

skill in the art. The seguen ^ 
«o«*- e . 3" Ad ITR (nucleotides 607-28 or »w 



1) ; CMV praoMri"""-* > " .507-4376 Of SBQ 

!D ho: Mi "cz 9«» (nucleotid e. 4320 
20 1) ; and a poly A »•*»■"» (nucleotides 837 

H01 %. «A i. . schesatic ofshuttl. vector 
^ CHVX.CZ contain », : » • » » „ /pr0Mt _ 
positioned head-to-tail, wi^n followed 
35 LacZ oini 9 en. "Xlovino^. 5 ^ 

by a PL-id PSPT2 (Pro^») b-£-^ 
.ndonucleaee enzymes are repressnted by 
designation, in the pl.»id .uno.ion of 

vector seance i. pacRa,ed Into virion heed, 
distinguishable fros the helper vxrions by the p 
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of the LacZ gene 
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Figs. 5A to 5F [SEQ ID NO: 2] report the top DHA 
strand of the double-stranded vector pAdAc.CMVLacZ. The 
complementary sequence may be readily obtained by one of 
skill in the art. The sequence includes the following 
5 components: 5' Ad ITR (nucleotides 600-958 of SEQ ID NO: 
2); CMV promoter/ enhancer (nucleotides 969-1563 of SEQ ID 
NO: 2); SD/SA sequence (nucleotides 1579-1711); LacZ gene 
(nucleotides 1762-5236 of SEQ ID NO: 2); poly A sequence 
(nucleotides 5245-5443 of SEQ ID NO: 2); and 3 1 Ad ITR 

10 (nucleotides 16-596 of SEQ ID NO: 2). 

Fig. 6 is a schematic of shuttle vector pAdA . CBCFTR 
containing 5 1 ITR from Ad5, followed by a chimeric CMV 
enhancer/ fi act in promoter enhancer, a CFTR gene, a poly-A 
sequence, a 3 1 ITR from Ad5, and remaining plasmid 

15 sequence from plasmid pSLH80 (Pharmacia) backbone. 
Restriction endonuclease enzymes are represented by 
conventional designations in the plasmid constructs. 

Figs. 7A to 7H [SEQ ID NO: 3] report the top DNA 
strand of the double-stranded plasmid pAdA. CBCFTR. The 

20 complementary sequence may be readily obtained by one of 
skill in the art. The sequence includes the following 
components: 5 f Ad ITR (nucleotides 9611-9254 of SEQ ID 
NO: 3); chimeric CMV enhancer/6 actin promoter 
(nucleotides 9241-8684 of SEQ ID NO: 3) ; CFTR gene 

25 (nucleotides 8622-4065 of SEQ ID NO: 3); poly A sequence 
(nucleotides 3887-3684 of SEQ ID NO: 3); and 3 1 Ad ITR 
(nucleotides 3652-3073 of SEQ ID NO: 3). The remaining 
plasmid backbone is obtained from pSL1180 (Pharmacia) . 

Fig. 8A illustrates the generation of 5 1 adenovirus 

30 terminal sequence that contained PAC domains I and II by 
PCR. See, arrows indicating righthand and lefthand (PAC 
II) PCR probes in Fig. IB. 

35 
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sequence that cont.ii.ed P»C doMin. I, II. » 

^ see, .rrow. indictinc, ri^thend end lefthend (PAC 

m ^/T'd.^f^uacetion product, atoned 
int o £ JStli c-nln, .it. of Pxd-Unx 1 «» vector 
-o, — n, ^^^^ 

modified packaging (PAC) signals. _ 

Fi g. 9A is a schematic representation of the 
subcloning of a human placenta alkaline phosp^a^ 
reporter minigene containing the imnediate i ^ e CIW 
enhancer/ promoter (CKV) , human placenta alkaline 
enhancer/ pr po lyadenylation signal 

phosphatase cDNA (hpAP) , ana &v«. p j he loer virus 

~ -vention-i demons in 

the plasmid constructs. 

%*. SB is a schematic representation ° 
subcloning of the same minigene of Fxg. 9A P 
to result in crippled helper virus vector 

pAd.PAcrv.cKV diagrM sunmarizing ^ , 

of a/aLovirus-hased poiycation ^irus connate 
and its comhination^ ^J^LTSi — P—* 

crosslinker sulfo-SMCC and the capsid protein^ f ibe 
, labeled with the nucleophilic maleimide moiety. Free 
^nvdrvls were introduced onto poly-L-lysine using 2- 
sulfhydryis were m labelled adenovirus, 

iminothiolane-HCl and mixed with the 
resulting in the helper virus conjugate Ad-pLys 

unigue adenovirus-based ^ J^T^J^ *° 
5 purifying the Ad-pLys conjugate over a CsCl gr 
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remove unincorporated poly-L- lysine, followed by 
extensively dialyzing, adding shuttle plasmid DNAs to Ad- 
pLys and allowing the complex formed by the shuttle 
plasmid wrapped around Ad-pLys to develop* 
5 Fig. 11 is a schematic diagram of pCCL-DMD, which is 

described in detail in Example 9 below. 

Fig. 12 A - 12P provides the continuous DNA sequence 
of pAdA • CMVmDys [SEQ ID NO:10]. 

10 Detailed Description of the Invention 

The present invention provides a unique recombinant 
adenovirus capable of delivering transgenes to target 
cells, as well as the components for production of the 
unique virus and methods for the use of the virus to 

15 treat a variety of genetic disorders. 

The AdA virus of this invention is a viral particle 
containing only the adenovirus cis-elements necessary for 
replication and virion encapsidation (i.e., ITRs and 
packaging sequences) , but otherwise deleted of all 

20 adenovirus genes (i.e., all viral open reading frames). 

This virus carries a selected transgene under the control 
of a selected promoter and other conventional regulatory 
components, such as a poly A signal. The AdA virus is 
characterized by improved persistence of the vector DNA 

25 in the host cells, reduced antigenicity/ immunogenic ity, 
and hence, improved performance as a delivery vehicle. 
An additional advantage of this invention is that the AdA 
virus permits the packaging of very large transgenes, 
such as a full-length dystrophin cDNA for the treatment 

30 of the progressive wasting of muscle tissue 

characteristic of Duchenne Muscular Dystrophy (DMD) . 

This novel recombinant virus is produced by use of 
an adenovirus-based vector production system containing 
two components: 1) a shuttle vector that comprises 

35 adenovirus cis-elements necessary for replication and 
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virion enc.psid.tion and i. deleted £££^£2 
which vector carrie. . reporter or therapeutic » 
Z 2 ) . hexper adenovirus which, alone - with . 

„ r-^ii line, is capable of providing all oi «i 

^rat ,1 Procure neceeeL, .or a productive viral 

Pre f erah ly . the heiper virus is »odi i- "that ^ 



r ^ic T , ------ 

desirably used in combination with a packaging c 



25 
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producing thie viral vector trom these conp 
both a novel means of psckaoin, o£ en 

aaenovlral/tranawen. containing vector "to » virus, and 

. novel sethod for the ^ 
helper virus from the newly formed recomoman 

X. The Shuttle Vector COBp0 sed 

The shuttle vector, referred to as paou, 
of adenovirus sequences, and transgene sequences, 
including vector regulatory control sequences. 

» Tf- Msnexixus frequences 

The adenovirus nucleic acid sequences of the 
shuttle vector provide the minimum adenovirus sequences 
sn , Mrtlela to be produced with the 

which enabl. ^7.. segu-nce. assist in 
assistance of a helper virus.. v-raet 
aelivery of e recosbi>»nt tr.ns,.n. ,.no»e to a target 
cell hy the resulting r.co^inant virus. ^ 

The DNA sequences of a number or 
. v *iiable from Genbank, including type Ad5 
types are availabl Y * 60 i ^ adenovirus sequences 
[Genbank Accession Mo. M73260). Tne sucn 
^ obtained from any known adenovirus serotype, sucn 
B ay be obtained from y including 

as serotypes 2, 3, 4, 7, 12 ana «u, 

any of the presently identified 41 human 

!a Horwitz, cited above]. Similarly adenoviruses 

e.g., Horwi«, employed m the 

known to infect other animals may also oe 
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vector constructs of this invention. The selection of 
the adenovirus type is not anticipated to limit the 
following invention. A variety of adenovirus strains are 
available from the American Type Culture Collection, 
5 Rockville, Maryland, or available by request from a 

variety of commercial and institutional sources. In the 
following exemplary embodiment an adenovirus, type 5 
(Ad5) is used for convenience. 

However, it is desirable to obtain a variety of 

10 pAdA shuttle vectors based on different human adenovirus 
serotypes. It is anticipated that a library of such 
plasmids and the resulting AdA viral vectors would be 
useful in a therapeutic regimen to evade cellular, and 
possibly humoral, immunity, and lengthen the duration of 

15 transgene expression, as well as improve the success of 
repeat therapeutic treatments. Additionally the use of 
various serotypes is believed to produce recombinant 
viruses with different tissue targeting specificities. 
The absence of adenoviral genes in the AdA viral vector 

20 is anticipated to reduce or eliminate adverse CTL 

response which normally causes destruction of recombinant 
adenoviruses deleted of only the El gene. 

Specifically, the adenovirus nucleic acid 
sequences employed in the pAdA shuttle vector of this 

25 invention are adenovirus genomic sequences from which all 
viral genes are deleted. More specifically, the 
adenovirus sequences employed are the cis-acting 5' and 
3 1 inverted terminal repeat (ITR) sequences of an 
adenovirus (which function as origins of replication) and 

30 the native 5 1 packaging/ enhancer domain, that contains 
sequences necessary for packaging linear Ad genomes and 
enhancer elements for the El promoter. These sequences 
are the sequences necessary for replication and virion 
encapsidation. See, e.g., P. Hearing et al, J. Virol. , 

35 £1(8) :2555-2558 (1987); M. Grable and P. Hearing, 
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ViroU, SA(5> S 2047-2056 , - 

Hearing, ^(2):723-731 (1992). 

According to this invention, the entire 
adenovirus 5' sequence containing the » and 

• ... v.^.r reaion can b« employed a« tfte o 
packaging/ enhancer region ^ ^ 

adenovirus eequence in *• P»» * genome umeful i» 
left terminal (5-) sequence of the M5 gen 
thi . invention sp^ bp 1 to about « 

conventional adenoviru. genome. al" r-err ^ 

«„« o-i of ^j*" 1 ^;, ^ ls SIQ Z ». », 

herein as nucleotides 5496 " l * . . . a 

, 600-958 of SEQ ID HOs 2; and nucleotides 

nucleotides 600 958 oi o w f about 

0 «,i-a254 of SEQ ID NO: 3, and generally is 

9611 9254 OI U , tn length. This sequence 

553 to about 360 nucleotides in lengxji. 

353 to an ^ adenovirus genome), 

includes the 5« ITR (bp * 1 19 a_ 35 8 of the 

and the packaging/enhancer domain (bp 194 358 

adenovirus genome). See, Figs 1A, *>*' i8 

Preferably, this native adenovirus 5 r g 

employed in the shuttle vector in unmodil |iec I form. 

However, some modifications ^^^^ do no t 

substitutions and additions to this sequence vhi 

««^ e t its biological function may be 

; !993. The ability to modify these ITR sequences is 

rh. ability of one of skill in the art. See, 
within the ability oi „ Mo i e cular Cloning. 

« texts such as Sambrook et al, moxewu 
e.g., texts sucn spring Harbor 

A Laboratory Manual.-, 2d edit., Cold Spr g 
moratory, Cold Spring Harbor, Hew York (1989). 

The 3« adenovirus sequences of tne snu 
, vector includ. tb. rigbt terminal » -J-^ - 

the adenoviral genom. .panning about bp ».» 
tb. adenoviru. genome, or map unite 9.-4 "O- 

. 1^:= - 1 » »0: , - gener.il, i. 
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about 580 nucleotides in length. This entire sequence is 
desirably employed as the 3 1 sequence qt an pAdA shuttle 
vector. Preferably, the native adenovirus 3 1 region is 
employed in the shuttle vector in unmodified form. 
5 However, some modifications tc this sequence which do not 
adversely effect its biological function may be 
acceptable. 

An exemplary pAdA shuttle vector of this 
invention, described below and in Fig. 2A, contains only 

10 those adenovirus sequences required for packaging 

adenoviral genomic DNA into a preformed capsid head. The 
pAdA vector contains Ad5 sequences encoding the 5' 
terminal and 3 9 terminal sequences (identified in the 
description of Fig. 3) , as well as the transgene 

15 sequences described below. 

From the foregoing information, it is expected 
that one of skill in the art may employ other equivalent 
adenovirus sequences for use in the AdA vectors of this 
invention. These sequences may include other adenovirus 

20 strains, or the above mentioned cis-acting sequences with 
minor modifications. 

a* The nrenggene 

The transgene sequence of the vector and 
recombinant virus is a nucleic acid sequence or reverse 
25 transcript thereof, heterologous to the adenovirus 
sequence, which encodes a polypeptide or protein of 
interest. The transgene is operatively linked to 
regulatory components in a manner which permits transgene 
transcription. 

30 The composition of the transgene sequence will 

depend upon the use to which the resulting virus will be 
put. For example, one type of transgene sequence 
includes a reporter sequence, which upon expression 
produces a detectable signal. Such reporter sequences 

35 include without limitation an E. coll beta-galactosidase 
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{LaCZ) CDNA, a human placental aUcaline ^ h ^J.T 
and a green fluorescent protein gene. These sequences, 
when associated with regulatory elements which drive 
their expression, provide signals detectable by 
conventional means, e.g., ultraviolet wavelength 
absorbance, visible color change, etc. a 

Another type of transgene sequence includes a 
therapeutic gene which expresses a desired gene product 
Tn a host cell. These therapeutic nucleic acid sequences 
typically encode products for administration and 
expression in a patient in vivo or ex vivo to re, > lace or 
correct an inherited or non-inherited genetic defect 
. reat an epigenetic disorder or disease. Such 
ZT^tZ 'genes which - desirable for th. perforce. 
„ f gen. therapy include, without U»i«t » «-l 
cystic librosia tran..e>d>rane relator (CPTR) gen ( 
Fid 7) . a low density lipoprotain (LDW sen. [*■ 

s LI« et al. au. ii=»"» <"~* «• T' 1 ^ 
OKA seguenc. IP-rtial seguences "aUabl. fro. GenBanK. 
Accession Hos. M36673, M36671, [A. P. Monaco et .1^ 
Mature., J2J:M6-650 (1986) J and L06900, [Roberts al, 

^aneTwnlch -ay ba raadlly selected by on. of .Kill 

art Tha selection of the transgene is not 
censored to be . li.it.tion of "is invention a. such 
selection is within the Knowledge of the art-sKilled. 



30 




in addition to the major elements identified 
ab ove for the pAdA shuttle vector, i.e., the adenov^ 
sequences and the transgene, the vector also includes 
conventional regulatory elements necessary *° ^ 
expression of the transgene in a cell transfected with 
Z pAdA vector. Thus the vector contains a selected 
Remoter which is linked to the transgene and located, 
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with the transgene, between the adenovirus sequences of 
the vector. 

Selection of the promoter is a routine matter 
and is not a limitation of the pAdA vector itself. 
5 Useful promoters may be const i cut ive promoters or 
regulated (inducible) promoters, which will enable 
control of the amount of the transgene to be expressed. 
For example, a desirable promoter is that of the 
cytomegalovirus immediate early promoter /enhancer [see, 

10 e.g., Boshart et al, Cell, 41:521-530 (1985)]. This 

promoter is found at nucleotides 5117-4524 of SEQ ID NO: 
1 and nucleotides 969-1563 of SEQ ID NO: 2. Another 
promoter is the CMV enhancer/chicken B-actin promoter 
(nucleotides 9241-8684 of SEQ ID NO: 3). Another 

15 desirable promoter includes, without limitation, the Rous 
sarcoma virus I/TR promoter /enhancer. Still other 
promoter /enhancer sequences may be selected by one of 
skill in the art. 

The shuttle vectors will also desirably contain 

20 nucleic acid sequences heterologous to the adenovirus 

sequences including sequences providing signals required 
for efficient polyadenylation of the transcript and 
introns with functional splice donor and acceptor sites 
(SD/SA) . A common poly-A sequence which is employed in 

25 the exemplary vectors of this invention is that derived 
from the papovavirus SV-40 [see, e.g., nucleotides 837- 
639 of SEQ ID NO: 1; 5245-5443 of SEQ ID NO: 2; and 3887- 

3684 of SEQ ID NO: 3]. The pbly-A sequence generally is 
inserted in the vector following the transgene sequences 

30 and before the 3 • adenovirus sequences. A common intron 
sequence is also derived from SV-40, and is referred to 
as the SV-40 T intron sequence [see, e.g., nucleotides 
4507-4376 of SEQ ID NO: 1 and 1579-1711 of SEQ ID NO: 2]. 
A pAdA shuttle vector of the present invention may also 

35 contain such an intron, desirably located between the 
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promoter/enhancr seguence and the transg.ne. Selection 
of these end other common vector elements «• 
convention.! end many euch ..fences .» I— 
. ... s.mbrook et .1, end reference, cited 
J.;,., of such re.ul.tory ...nonces for the above ere 

provided in the pl..»id seguence. - «•* * 

The combination of the trensgene, promoter/ 

enheneer. the other reguletory vector elements «™ 
ref erred t. .. e -ninigene- for e.e. 

The minigene is preferebly flenX.d by the S • end 3 ci. 

acting .Loviru. ..qu.ncee described .hove. Such . 
ecting eo ^ >everal hundr ed 

r^ovitue early end iet. gen. J ^^T" 
Thus, thi. Xdi vector eyet^ ^ «J^ef- 
latitude in the selection of t&e various *- 
latitude ,_ 4eularlv the selected transgene, with 

^ " a 1o7is. r^S£ - teeching. of thi. 
"vention. Redesign of .uch . minigene cen be made by 
resort to conventional techniques. 

the Ada shuttle vector, . helper ed«.oviru. of 
Tie invention must, .lone or in concert with . peck.,-, 
~U line, provide sufficient edenoviru. gene seguence. 
Tec seer:' for . productive vir.l infection Help- 
viru.e. useful in thi. invention thus contein .elected 
sdenoviru. gen. seguences. .nd option.lly . second 

' ^rJny!- Production of e recembinent edenoviru. 
„hich utilises helper .denoviru. contsin ng .JuU^ 
complement of .denovirsl genes results in rec 
virus contaminated by excess production of the helper 

5 virus. Thus, extensive purification of the vir.l vector 
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from the contaminating helper virus is required. 
However, the present invention provides a way to 
facilitate purification and reduce contamination by 
crippling the helper virus • 
5 One preferred embodiment of a helper virus of this 

invention thus contains three components (A) 
modifications or deletions of the native adenoviral gene 
sequences which direct efficient packaging, so as to 
substantially disable or "cripple" the packaging function 

10 of the helper virus or its ability to replicate, (B) 

selected adenovirus genes and (C) an optional reporter 
minigene. These "crippled" helper viruses may also be 
formed into poly-cation conjugates as described below. 
The adenovirus sequences forming the helper virus 

15 may be obtained from the sources identified above in the 
discussion of the shuttle vector. Use of different Ad 
serotypes as helper viruses enables production of 
recombinant viruses containing the AAd (serotype 5) 
shuttle vector sequences in a capsid formed by the other 

20 serotype adenovirus. These recombinant viruses are 

desirable in targeting different tissues, or evading an 
immune response to the AAd sequences having a serotype 5 
capsid. Use of these different Ad serotype helper 
viruses may also demonstrate advantages in recombinant 

25 virus production, stability and better packaging. 

Aj. The Crippling Mod ifications 

A desirable helper virus used in the production 
of the adenovirus vector of this invention is modified 
(or crippled) in its 5 f ITR packaging/enhancer domain, 

30 identified above. As stated above, the 

packaging/ enhancer region contains sequences necessary 
for packaging linear adenovirus genomes ("PAC" 
sequences) . More specifically, this sequence contains at 
least seven distinct yet functionally redundant domains 
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that are required for efficient encapsidation of 

replicated viral DNA. 

Within a stretch of nucleotide sequence from bp 

19 4-358 of the Ad5 genome, five of these so-called A- 
repeats or PAC sequences are localized (see, Fig. IB) . 
PAC I is located at bp 241-248 of the adenovirus genome 
( on the strand complementary to nucleotides 5259-5246 of 
SEQ ID NO: 1) . PAC II is located at bp 262-269 of the 
adenovirus genome (on the strand complementary to 
nucleotides 5238-5225 of SEQ ID NO: 1) . PAC III i* 
located at bp 304-311 of the adenovirus genome (on the 
strand complementary to nucleotides 5196-5183 of SEQ ID 
NO: 1). PAC IV is located at bp 314-321 of the 
adenovirus (on the strand complementary to nucleotides 
5186-5172 of SEQ ID NO: 1). PAC V is located at bp 339 
346 of the adenovirus (on the strand complementary to 
nucleotides 5171-5147 of SEQ ID NO: 1) . 

Corresponding sequences can be obtained from 
SEQ ID NO: 2 and 3. PAC I is located at nucleotides 837- 
851 of SEQ ID NO: 2; and on the strand complementary to 
nucleotides 9374-9360 of SEQ ID NO: 3. PAC II £ located 
at nucleotides 859-863 of SEQ ID NO: 2; and on the strand 
oomplementary to nucleotides 9353-9340 of SEQ ID NO: 3 
PAC III is located at nucleotides 

2; and on the strand complementary to nucleotides 9311 
9298 of SEQ ID NO: 3. PAC IV is located at nucleotides 
911-924 of SEQ ID NO: 2; and on the strand complementary 
to nucleotides 9301-9288 of SEQ ID NO: 3. PAC V is 
located at nucleotides 936-949 of SEQ ID NO: 2; and on 
the strand complementary to nucleotides 9276-9263 of SEQ 



ID NO: 3. 
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Table 1 below lists these five native Ad5 
sequences and a consensus PAC sequence based on the 
similarities between an eight nucleic acid stretch within 
the five sequences. The consensus sequence contains two 
positions at which the nucleic acid may be A or T (A/T) . 
The conventional single letter designations are used for 
the nucleic acids, as is known to the art. 

Table 1 



Adenovirus Genome 
Base Pair Nos • & 
ft-pepeafr PMpleoUfl? pequeppe 

15 241 248 

I TAG TAAATTTG GGC [SEQ ID NO: 4] 



262 269 

II AGT AAGATTTG GCC [SEQ ID NO: 5] 

304 311 
AGT GAAATCTG AAT [SEQ ID NO: 6] 



314 321 

25 IV GAA TAATTTTG TGT [SEQ ID NO: 7] 

339 346 

V CGT AATATTTG TCT [SEQ ID NO: 8] 

30 Consensus 5 f (A/T) AN (A/T) TTTG 3 ' [SEQ ID NO: 9] 

According to this invention, mutations or 
deletions may be made to one or more of these PAC 
sequences to generate desirable crippled helper viruses. 

35 A deletion analysis of the packaging domain revealed a 

positive correlation between encapsidation efficiency and 
the number of packaging A-repeats that were present at 
the 5 1 end of the genome. Modifications of this domain 
may include 5* adenovirus sequences which contain less 

40 than all five of the PAC sequences of Table 1. For 
example, only two PAC sequences may be present in the 
crippled virus, e.g., PAC I and PAC II, PAC III and PAC 
IV, and so on. Deletions of selected PAC sequences may 
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involve deletion of contiguous or non-contiguous 
sequences. For example, PAC II and PAC IV may be 
letted, leaving PAC I, III and IV in the 5' sequence. 
Still an alternative modification may be the replacement 
of one or more of the native P.C sequences with one or 
more repeats of the consensus sequence of Table 1. 
Alternatively, this adenovirus region may be modif ied by 
deliberately inserted mutations which disrupt one or more 
of the native PAC sequences. One of skill in the art may 
further manipulate the PAC sequences to similarly achxeve 
the effect of reducing the helper virus packaging 
efficiency to a desired level. 

Exemplary helper viruses which involve the 
manipulation of the PAC sequences described above .are 
disclosed in Example 7 below. Briefly, as described in 
that example, one helper virus contains in place of the 
native 5- ITR region (adenovirus genome bp 1-360) , a 5 
adenovirus sequence spanning adenovirus *enome bP 1-269, 
which contains only the 5' ITR and PAC I and 
sequences, and deletes the adenovirus region bp 270 360. 

Another PAC sequence modified helper virus 
contains only the 5- Ad5 sequence of the ITR and PAC I 
through PAC IV (Ad bp 1-321), deleting PAC V and other 
sequences in the Ad region bp322-360. 

These modified helper viruses are cl 
by reduced efficiency of helper virus encapsidation. 
Ise helper viruses with the specif ic modif icat ons of 
the sequences related to packaging efficxency, provide 
packaging efficiency high enough for generating 
production lots of the helper virus, yet low enough that 
they permit the achievement of higher yields of Ada 
Transducing viral particles according to this inventxon. 
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Helper viruses useful in this invention, 
whether or not they contain the "crippling" modifications 
described above, contain selected adenovirus gene 
5 sequences depending upon the call line which is 

transfected by the helper virus and shuttle vector. A 
preferred helper virus contains a variety of adenovirus 
genes in addition to the modified sequences described 
above. 

10 As one example, if the cell line employed to 

produce the recombinant virus is not a packaging cell 
line, the helper virus may be a wild type Ad virus. 
Thus, the helper virus supplies the necessary adenovirus 
early genes El, £2, E4 and all remaining late, 

15 intermediate, structural and non-structural genes of the 
adenovirus genome. This helper virus may be a crippled 
helper virus by incorporating modifications in its native 
5* packaging/ enhancer domain. 

A desirable helper virus is replication 

20 defective and lacks all or a sufficient portion of the 

adenoviral early immediate early gene Ela (which spans mu 
1.3 to 4.5) and delayed early gene Elb (which spans mu 
4.6 to 11.2) so as to eliminate their normal biological 
functions. Such replication deficient viruses may also 

25 have crippling modifications in the packaging /enhancer 
domain. Because of the difficulty surrounding the 
absolute removal of adenovirus from AdA preparations that 
have been enriched by CsCl buoyant density 
centrifugation, the use of a replication defective 

30 adenovirus helper prevents the introduction of infectious 
adenovirus for in vivo animal studies. This helper virus 
is employed with a packaging cell line which supplies the 
deficient El proteins, such as the 293 cell line. 
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Additionally, all or a portion of the 
adenovirus delayed early gene E3 (which s^ne -^ 
86 2) may be eliminated from the adenovirus ^ e " C * 
86.2) may viru ses useful in this 

which forms a part of the ^ £unction o£ 

rsr for the ^-J^S -Unes 

in the presence of other pacicaging 

such additionally deleted helper virus 
contain crippling .edifications as — »*- *~ ' 



25 



30 



35 



* t^Tde^. «- help- virus to 
contain . reporter *ini,ene. in ,«» 
U desirably different fro. the reporter tr.ns^.n. 
itemed in * shuttle 

reporter 9 ene. are *no«m, vnteh u 

sur^: renrrru, r - 

r/ecomhinant - virus and of 

r^STSJl- eLhles residua. 
Utiles of contaminating adenovirus to he monitored 
Independent of recombinant LacZ expressed by an pAda 
shuttle vector or an Ada virus. 

Still another method for reducing the 
contamination of helper virus involves the formation 
poly-cation helper virus conjugates, which maybe 

• with a plasmid containing other adenoviral 
associated with a piasm helt>e r virus. The 

. no t present in the helper vitw. 

ZZr vl use descriL ahove »y he further ^med hy 
M to adenovirus-polylysine connate technology. 
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See, e.g., Wu et al, J. Biol, Chem. . 2&±i 16985-16987 
(1989); and K. J. Fisher and J. M. Wilson, Biochem. J. . 
299 ; 49 (April 1, 1994), incorporated herein by 
reference. 

5 Using this technology, a helper virus 

containing preferably the late adenoviral genes is 
modified by the addition of a poly-cation sequence 
distributed around the caps id of the helper virus. 
Preferably, the poly-cation is poly-lysine, which 

10 attaches around the negatively-charged vector to form an 
external positive charge. A plasmid is then designed to 
express those adenoviral genes not present in the helper 
virus, e.g., the El, E2 and/ or E4 genes. The plasmid 
associates to the helper virus -conjugate through the 

15 charges on the poly-lysine sequence. This modification 

is also desirably made to a crippled helper virus of this 
invention. This conjugate (also termed a trans- infection 
particle) permits additional adenovirus genes to be 
removed from the helper virus and be present on a plasmid 

20 which does not become incorporated into the virus during 
production of the recombinant viral vector. Thus, the 
impact of contamination is considerably lessened. 

XIX. Assembly of Shuttle Vector, Helper Virus and 

25 production of Recombinant Virus 

The material from which the sequences used in the 
pAdA shuttle vector and the helper viruses are derived, 
as well as the various vector '• components and sequences 
employed in the construction of the shuttle vectors, 

30 helper viruses, and AdA viruses of this invention, are 
obtained from commercial or academic sources based on 
previously published and described materials. These 
materials may also be obtained from an individual patient 
or generated and selected using standard recombinant 

35 molecular cloning techniques known and practiced by those 
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skilled in the art. Any modif ication of existing nucleic 
ac id sequences forcing the vectors and druses including 
sequence deletions, insertions, and other mutations are 
also generated using standard techniques. 

Assembly of the selected MA sequences of the 
adenovirus, and the reporter genes or therapeutic genes 
an™ vector elements into the pAdA shuttle vector 
using conventional techniques is described in Example 1 
below. Such techniques include conventional cloning 
techniques of cDNA such as those described in texts 
rsambrook et al, cited above], use of overlapping 
oligonucleotide sequences of ^which 
polymerase chain reaction, and any suitable » c « 
Provides the desired nucleotide sequence. Standard 
Tansfection and co-transf action 

e g., CaP0 4 transfection techniques using the HEK 293 
ceU line/ Other conventional methods employed n th 
invention include homologous recombination of «-^«\ 
genomes, plaquing of viruses in agar overlay, method, of 
curing signal generation and ^ J^jL 
>nv desired AdA vector or helper virus o£ tnis 1 
is within th. sUU or the art. hesed on the teechings or 

this invention. 

As described in detail in Example 1 below and 

with resort to Fig. 2A and the DNA sequence of the 

witn reso un igue pAdA shuttle vector 

plasmid reported in Fig. 3, a unique e 

of this invention, pAdA . CKVLacZ , is 
pAdA.CMVLacZ contains Ad5 sequences encoding the 5 
terminal followed by a CMV promoter /enhancer a splice 
donor/splice acceptor sequence, a bacterial beta- 
galactosidase gene (LacZ, , a SV-40 poly A sequence (pA) , 
a 3 * ITR from Ad5 and remaining plasmid sequence from 
plasmid pSP72 (Promega) backbone. 
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To generate the Ad A genome which is 
incorporated in the vector, the plasmid pAdA . CMVLacZ must 
be must be digested with EcoRI to release the AdA • CMVLacZ 
genome, freeing the adenovirus ITRs and making them 
5 available targets for replication. Thus production of 
the vector is "restriction-dependent", i.e., requires 
restriction endonuclease rescue of the replication 
template. See, Fig. 2B. 

A second type of pAdA plasmid was designed 

10 which places the 3 f Ad terminal sequence in a head-to- 
tail arrangement relative to the 5 1 terminal sequence. 
As described in Example 1 and Figs. 4A, and with resort 
to the DNA sequence of the plasmid reported in Fig. 5, a 
second unique AdA vector sequence of this invention, 

15 Ad Ac. CMVLacZ, is generated from the shuttle plasmid 

pAdAc. CMVLacZ, which contains an Ad5 5 9 ITR sequence and 
3 1 ITR sequence positioned head-to-tail, followed by a 
CMV enhancer/ promoter, SD/SA sequence, Lac 2 gene and pA 
sequence in a plasmid pSP72 (Promega) backbone. As 

20 described in Example IB, this "restriction-independent* 1 
plasmid permits the AdA genome to be replicated and 
rescued from the plasmid backbone without including an 
endonuclease treatment (see, Fig. 4B) • 
Ell. Helper Virus 

25 As described in detail in Example 2, an 

exemplary conventional El deleted adenovirus helper virus 
is virus Ad.CBhpAP, which contains a 5' adenovirus 
sequence from mu 0-1, a reporter minigene containing 
human placenta alkaline phosphatase (hpAP) under the 

30 transcriptional control of the chicken B-actin promoter, 
followed by a poly-A sequence from SV40, followed by 
adenovirus sequences from 9.2 to 78.4 and 86 to 100. 
This helper contained deletions from mu 1.0 to 9.2 and 
78.4 to 86, which eliminate substantially the El region 

35 and the E3 region of the virus. This virus may be 
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desirably crippled according to this invention by 
edifications to its packaging enhanoer domain. 

Exemplary crippled helper viruses of this 
< ntion are described using the techniques described in 
invention are oesct sequences, 
5 Example 7 and contain the modified 5 PAC sequ 
D ^Au« r i^ofio- m.u. 0-0.75 or 

i.... ..enovirus " Briefly , tb. 

aaenovirua geno» bp 1 3,1,^ ^ ^ oonventioml 

.equences " „j,,„ovin» based pl.»i<J. 

technic*, into » ^ ^ plMlnli , „„i=b 

«7«» - * it. 

the reporter minigene is roxxuw^ 

rrwt 0 ^: - ... *• - 

" " " ""^oration of . po ly -Wy.in. ~»^" ^ 

in Example 5 below and Fig. 10 by co P * same 
to the Ad . CBhpAP virion modified 
20 procedure may be employed with the PAC sequ 



helper viruses of this invention. 



* r^r^TT^ veto, in tb. 

pr e.«,c. o f boxper viru. .na/or . £L*Z*» 
i*. *denovirus-transgene sequences xn 

25 P erBitS *!"u c ILd and packaged into virion capsxds, 

VeCt ^i„a Tn tZ recombinant AdA virus. The current 
resultxng in tne reco is trans fection-based 

method for producing such AdA vxrus xs 

and described in detail in Exam, e 3 *M 1^ ^ 
30 viru8 is used^ to subsequently 

line human HEX 293, r contai ning a 

transfected with an pAdA shuttle ^cto 

*™nsaene by conventional methods, adou 
selected transgene oy harvested, 

hours post-transfectxon, the ceixs ar 
more hours pos^ Be is 

35 and an extract prepared. The aoa 
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packaged into virions that sediment at a lover density 
than the helper virus in cesium gradients. Thus, the 
recombinant AdA virus containing a selected transgene is 
separated from the bulk of the helper virus by 
5 purification via buoyant density ultracentrifugation in a 
CsCl gradient. 

The yield of AdA transducing virus is largely 
dependent on the number of cells that are transfected 
with the pAdA shuttle plasmid, making it desirable to use 

10 a transfection protocol with high efficiency. One such 
method involves use of a poly-L-lysinylated helper 
adenovirus as described above. A pAdA shuttle plasmid 
containing the desired transgene under the control of a 
suitable promoter, as described above, is then complexed 

15 directly to the positively charged helper virus capsid, 
resulting in the formation of a single transfection 
particle containing the pAdA shuttle vector and the 
helper functions of the helper virus. 

The underlying principle is that the helper 

20 adenovirus coated with plasmid pAdA DNA will co-transport 
the attached nucleic acid across the cell membrane and 
into the cytoplasm according to its normal mechanism of 
cell entry. Therefore, the poly-L- lysine modified helper 
adenovirus assumes multiple roles in the context of an 

25 AdA-based complex. First, it is the structural 

foundation upon which plasmid DNA can bind increasing the 
effective concentration. Second, receptor mediated 
endocytosis of the virus provides the vehicle for cell 
uptake of the plasmid DNA. Third, the endosomalytic 

30 activity associated with adenoviral infection facilitates 
the release of internalized plasmid into the cytoplasm. 
And the adenovirus contributes trans helper functions on 
which the recombinant AdA virus is dependent for 
replication and packaging of transducing viral particles. 

35 The Ad-based transfection procedure using an pAdA shuttle 
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vector and a poi-yc.tlon-h.ip~ conjugate ^ f 
EiI „ pl . 6 . Additionally. « d..=ribed Piously, the 
^etoer viru.-pl.smid conjugate may be ..other for. of 
TXr vL» Livery of the S 

t;:: rre^d^iXUTor d^ 

Te ten £ "..id - the helper virus thus reducing 
the .elf-replication efficiency of the helper virus. 

A presently preferred method of producing the 
reeomhinent Ada virus of this invention involve. 

\„™ino the ebove-deecribed tr.nsfection with the 
performing tne eoov h .,„, virus conjugate, 

crippled helper virus or crippled helper virus this 
Z described ebove. A -crippled" helper virus of this 

nvencTon is unable to pacheg. itself 
therefor per^ts ready separetion of the h.lp« viru 
^ the newly p.*.g.d A« vector of this invent °n by 
u .e of buoyant density ultracentrifugation in a CC1 
gr.di.nt, .. described in the example, below. 

tv mnctio. of the *eoo>4>inent Adi viru. 

Ada virus of this invention 1. produced by 
c „ 0 p.rTion of the shuttle vector and help- viru.. ^ 
Add virus c.n b. t.rget.d to. .nd trophy, a sel 
target cell. The selection of the target cell also 

. „.« ot the recombinent virus, i.e., 
depends upon the us. of the r „ U oated in vitro 

whether or not the tr.n.gene i. to be repll 
or or vivo for production in . ~»- »1 ^ ^ ^ 

«U type r t l.l -get cell, may be any 
, ^H-TceTl Cpf.ferably a hu-n cellK 

„ in vivo ~ 

ulitttion neurons, hepetocytes, epitheli.1 cell. and 
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the like. The helper adenovirus sequences supply the 
sequences necessary to permit uptake of the virus by the 
AdA. 

Once the recombinant virus is taken up by a cell, 
5 the adenovirus flanked transgene is rescued from the 
parental adenovirus backbone by the machinery of the 
infected cell, as with other recombinant adenoviruses. 
Once uncoupled (rescued) from the genome of the AdA 
virus, the recombinant minigene seeks an integration site 
10 in the host chromatin and becomes integrated therein, 

either transiently or stably, providing expression of the 
accompanying transgene in the host cell. 

V. Use of the AdA Viruses in Gene Therapy 

15 The novel recombinant viruses and viral conjugates 

of this invention provide efficient gene transfer 
vehicles for somatic gene therapy. These viruses are 
prepared to contain a therapeutic gene in place of the 
LacZ reporter transgene . illustrated in the exemplary 

20 viruses and vectors. By use of the AdA viruses 

containing therapeutic transgenes, these transgenes can 
be delivered to a patient in vivo or ex vivo to provide 
for integration of the desired gene into a target cell. 
Thus, these viruses can be employed to correct genetic 

25 deficiencies or defects. An example of the generation of 
an AdA gene transfer vehicle for the treatment of cystic 
fibrosis is described in Example 4 below. One of skill 
in the art can generate any number of other gene transfer 
vehicles by including a selected transgene for the 

30 treatment of other disorders. 

The recombinant viruses of the present invention may 
be administered to a patient, preferably suspended in a 
biologically compatible solution or pharmaceutically 
acceptable delivery vehicle. A suitable vehicle includes 

35 sterile saline. Other aqueous and non-aqueous isotonic 
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sterile inaction solution, and aoueou. ^^T^' 
sterile .uspsnslons known to be pharm.cutic.lly 
"dpt.". currier, end weU Known to those of .Xill in 
the art nay be employed for this purpose. 

The recombinant viruses of this invention maybe 
administered in sufficient amount, to trensfect the 
desired cell, end provide sufficient levels of 
oes " tMk n * «, he selected transgene to 

integration and expression of the select 

pfcvfde . therapeutic benefit without undue edvers. 
'effects or with medically ecc.pt, >bl. Phy- io lo, c a ^ 
sffect. which on be determined by those .Xill 
.rf <c ,l art., conventional end phermeceutically 

direct deliver, to the target «^ » 
intranasal, intravenous, intremuscular, 
intredermel and oral administration. Route, of 
administration may be combined 

nosaaes of the recombinant virus wiaj. « f 
primed I on fetor, .uch as the condition bein, treated, 

pri J , +hB aae weight and health of the 

the selected gene, the age, weig»w 



patient, and may thus very anon, * at """- ot 

therapeutically ° , te Ue of 

the present invention is believed to 

tro /about 20 to .bout 50 ml of „ , x 

.ontninin, concentrates ^frorn £-* ^ » £ _ 
10 10 pfu/ml virus of the present in 

huma n dosage is about 20 ml ^■^^'L^o. 
concentrations. The dosage will be 
the therapeutic benefit against any side effects, 
levels of expression of the selected gene can be 
Stored to determine the selection, adjustment or 
frequency of dosage administration. 
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The following examples illustrate the construction 
of the pAdA shuttle vectors, helper viruses and 
recombinant AdA viruses of the present invention and the 
use thereof in gene therapy. These examples are 
5 illustrative only, and do not limit the scope of the 
present invention. 

Example 1 - Production of PAdA . CMVLacZ and pAdAc. CMVLacZ 
Shuttle Vectors 

10 A. PA4A t QfWlflgZ 

A human adenovirus Ad5 sequence was modified to 

contain a deletion in the Ela region [map units 1 to 
9.2], which immediately follows the Ad 5 1 region (bp 1- 
360) (illustrated in Figs. 1A) . Thus, the plasmid 

15 contains the 5 1 ITR sequence (bp 1-103), the native 

packaging/enhancer sequences and the TATA box for the Ela 
region (bp 104-360). A minigene containing the CMV 
immediate early enhancer /promoter, an SD/SA sequence, a 
cytoplasmic lacZ gene, and SV40 poly A (pA) , was 

20 introduced at the site of the Ela deletion. This 

construct was further modified so that the minigene is 
followed by the 3 1 ITR sequences (bp 35,353-end) . The 
DNA sequences for these components are provided in Fig. 3 
and SEQ ID NO: 1 (see, also the brief description of this 

25 figure) • 

This construct was then cloned by conventional 
techniques into a pSP72 vector (Promega) backbone to make 
the circular shuttle vector pAdACMVLacZ • See the 
schematic of Fig. 2A. This construct was engineered with 

30 EcoRI sites flanking the 5' and 3 f Ad5 ITR sequences. 
pAdA. CMVLacZ was then subjected to enzymatic digestion 
with EcoRI, releasing a linear fragment of the vector 
spanning the terminal end of the Ad 5' ITR sequence 
through the terminal end of the 3 ■ ITR sequence from the 

35 plasmid backbone. See Fig. 2B. 
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20 



25 



- ^r^txTv«otor pXdic.CMVL.cz (Figs. « ««* 

5) M . constructed using . PSP72 (Pro..,.) 
Lt the Ado 5- ITR .nd 3- IS -re portioned h..d-to 
Zn The or,.ni»tion of th. Ad5 ITR. w.. be— on 
till. .,_.,„ M geno.es th«t h.ve the 

reports thet suggest circular Ad gen intectiou . 

t.r.in.1 end. fused together h«d-to-t.il ere 
to levels co.per.ble to lln..r Ad geno.es. * « > 
coding th. CMV .nh.nc.r. » ^^^^ 
g .ne, end th. poly * ."J.."" we. inserted l«m 
following the 5- ITR. The DBA sequence of the resulting 
pte Hi .nd th. sequence, for th. individu.1 -P-nt. 
ere reported in Pi,. * end SEQ ID HO, 2 «... '»°< 
description of ») • This P l»« *~ "* "f^,. 

LyJic digestion prior to it. us. to Produce the vrr. 

cs^Icl. (s.. » • Thls Ve0t ° r ^ „ 

P enfb le re triction-independent production of I~Z Ad. 
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vectors • 



Th. Ad.CBhpAP helper virus . 



rrll Mel, vfflCTi. aiwi— - - • , phosptotese 

deficient .d.novirus confining .n 

nlnigene. It. construction involve- .convent ' 

end hoeologou. — -17001 
DNA substrate was extracted from Csci p 

virions, en Ad. (..retype subgroup C, v.r^ « ^ 
carri .s . 3 >b deletion bet» - - «•« - ^ ^ 
nonessential E3 region ipr . viral DNA 

was pi^F . w on \ which removes the 

(adenovirus genomic bp P«xtx« «7) ^ units 
left arm of the genome encompassing adenovirus 
0-2.5. See lower diagram of Fig. IB. 
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A parental cloning vector , pAd.Bglll was designed. 
It contains two segments of wild-type Ad5 genome (i.e., 
map units 0-1 and 9-16.1) separated by a unique Bglll 
cloning site for insertion of heterologous sequences. 
5 The missing Ad5 sequences betveen the two domains 

(adenovirus genome bp 361-3327) results in the deletion 
of Ela and the majority of Elb following recombination 
with viral DNA. 

A recombinant hpAP minigene was designed and 
10 inserted into the Bglll site of pAd.Bglll to generate the 
complementing plasmid, pAdCBhpAP. The linear arrangement 
of this minigene includes: 

(a) the chicken cytoplasmic B-actin promoter 
[nucleotides +1 to +275 as described in T. A. Kost et al, 

15 NWl. &SidB Rggw 11(23) :8287 (1983); nucleotides 9241- 
8684 of Fig. 7]| 

(b) an SV40 intron (e.g., nucleotides 1579^1711 of 
SEQ ID NO: 2) , 

(c) the sequence for human placental alkaline 
20 phosphatase (available from Genbank) and 

(d) an SV40 polyadenylation signal (a 237 Bam HI- 
Bcll restriction fragment containing the cleavage/poly-A 
signals from both the early and late transcription units; 
e.g., nucleotides 837-639 of SEQ ID NO: 1). 

25 The resulting complementing plasmid, pAdCBhpAP 

contained a single copy of recombinant hpAP minigene 
flanked by adenovirus coordinates 0-1 on one side and 
9.2-16.1 on the other. 

Plasmid DNA was linearized using a unique Nhel site 

30 immediately 5' to adenovirus map unit zero (0) and the 
above- identified adenovirus substrate and the 
complementing plasmid DNAs were transfected to 293 cells 
[ATCC CRL1573] using a standard calcium phosphate 
transfection procedure [see, e.g., Sambrook et al, cited 

35 above]. The end result of homologous recombination 
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involving ^« «>.t mep to -*-"«"-. •» ™£ 
M.» is hybrid M .CBhp*> ».1P« ™d Eib 

adenovirus m.p units 0-1 end, in piece of the El. end E 
coding region, fro- the d!7001 adenovirus 

th. hpAP ninivene from the pU-M. <°™ » ;<-*< 

ft mo with a deletion m the E3 (7B.4 oo 
sequences 9 to 100, wiui « 

mu) regions. 

1, a EEsflacfcian st itprmMnnii* M» Virus 

EaEE ^^^^ of this invention are 

derated by co-transfection of a shuttle vector with the 
helper virus in a selected packaging or non-packaging 

described in detail below, the linear fragment 
provided in Example 1A, or the circular AdA genome 
carrying the LacZ of Example IB, is packaged into the 
IdXl helper virus (Example 2) -^ conventional 
fflrhBiaues wh ich provides an empty capsid head, as 

"ed in Pig' ,c. Those virus particle. vbicn Kav, 
euocessfully tafcen up the pAd shuttle genome 
oaPstd heed can he distinguished fro. those contaxn ng 
;„e npAP gen. hy virtue of the differential expressxon of 

" r^deteil. »3 cells (« * *■ 
» dish, were seeded and infected with helper virus 
ed.CBhpAP (Produced as described in Examp e 2 st^n HOI 
o£ 5 in 20 ml DMEK/2% fetal bovine serum (EES). TM. 
bslper spscif ic marker is critical for momtorin, the 
r" ' w virus contamination in Adi preperations 
V a i«Ir ourificetlon. The helper virus provides 

- "1.^ helper function, for synthesis end 

Two nour * .tmttle vector or the restriction- 

restriction-dependent shuttle vector 

s independent shuttle vector, plasmid pAdA.CMVLacZ 
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(digested with ScoRI) or pAdAc.CMVLacZ DNA, each carrying 
a LacZ minigene, was added to the cells by a calcium 
phosphate precipitate (2.5 ml calcium phosphate 
transf ection cocktail containing 50 M9 plasmid DNA) . 
5 Thirty to forty hours posi-transfection, cells were 

harvested, suspended in 10 mM Tris-Cl (pH 8.0) (0.5 
ml/150 mm plate) and frozen at -80°C. Frozen cell 
suspensions were subjected to three rounds of freeze 
(ethanol-dry ice) -thaw (37°C) cycles to release virion 

10 capsids. Cell debris was removed by centrifugation 

(5,000xg for 10 minutes) and the clarified supernatant 
applied to a CsCl gradients to separate recombinant virus 
from helper virus as follows. 

Supernatants (10 ml) applied to the discontinuous 

15 CsCl gradient (composed of equal volumes of CsCl at 1.2 
g/ml, 1.36 g/ml, and 1.45 g/ml 10 mM Tris-Cl (pH 8.0)) 
were centrifuged for 8 hours at 72,128Xg, resulting in 
separation of infectious helper virus from incompletely 
formed virions. Fractions were collected from the 

20 interfacing zone between the helper and top components 
and analyzed by Southern blot hybridization or for the 
presence of LacZ transducing particles. For functional 
analysis, aliquots (2.0 ml from each sample) from the 
same fractions were added to monolayers of 293 cells (in 

25 35 mm wells) and expression of recombinant fi- 
galactosidase determined 24 hours later. More 
specifically, monolayers were harvested, suspended in 0.3 
ml 10 mM Tris-Cl (pH 8.0) buffer and an extract prepared 
by three rounds of freeze-thaw cycles. Cell debris was 

30 removed by centrifugation and the supernatant tested for 
fi-galactosidase (LacZ) activity according to the 
procedure described in J. Price et al, Proc. Natl. Acad, 
Sci. , USA . M: 156-160 (1987). The specific activity 
(milliunits 6-galactosidase/mg protein or reporter 
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25 



ensy... wes .easursd fro. indictor ells. Per the 
recount virus, .pacific .ctivity w.s 116. 

actions with 0-g.l.ctosid... .etivlty fro. the 
alscontinuoua gradient wore eedi.ented through an 
eouilibriu. cesiu. gr.di.nt to furth.r enrich the 
P !.P.r.tio» for Ada viru.. » Un..r gradient wes 
Tnereted in the .re. or the reco^inant viru. .penning 
densities 1.29 to 1.34g./.l- A sharp P .«* <* 
r ecort=in.nt virus, d.t.ct.d .. the .ppe,r.nce of the • 
Zl .ctivity in infected 203 cell., .luted betwe«n 1.31 
In* l!' gs/dl- Thi. p.* of reco.bin.nt virus w.s 
"cored between two sejor * 260 - Sorbin, pe.K. .nd in 
.„ of th. gr.di.nt with th. helper virus w.s 

precipitously dropping off. Ths e^llibriu. 
.edis.nt.tion gr.di.nt acco.pli.hed another 102 to 103^ 
£ol d purif ication of reco.bin.nt virus fro. hslper virus^ 
^ yield of reco.bin.nt M^c, virus 
e 50 plat, prep .fter 2 ..dictation. r«nged fro. 107 
100 transducing particles. 

»n.ly.i. of lysaf. of 0.11. trensfected with the 
r .cosbinent vector and infected with helper revealed 
virions capable of transducing th. recount nrnigen. 
contained within th. v.ctor. Subjecting .liquet of the 
fraction, to South.rn analy.is u.ing prob.. specrf ic to 
Z reco.bin.nt virus or h.lp.r virus rev.al.d packing 
of .ultiple «»lecul« for., of vector derived scenes. 
1 predc.in.nt for. of the deleted viral 
lis. n 5 kb) of th. corresponding double str«.d.d DHA 

L7cMV«=2) with le.s abundant but discrete 
.oncer (Ada.CMVLacii <iu _ also 

, hlg her .oleculer w.i,ht .peci.. ( 10 Kb end 

„«. Full-length helper virus xs 35kb. import y , 
^Hea* o^ec .^Transduction activity corresponds with 
Z high., solecular weight for. of the deleted virus. 
These results contir. th. hypothesis that IT*, and 

5 contiguous packaging sequence are the only elesents 
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necessary for incorporation into virions. An apparently 
ordered or preferred rearrangement of the recombinant Ad 
monomer genome leads to a more biologically active 
molecule. The fact that larger molecular species of the 
5 deleted genome are 2x and 3x i old larger than the monomer 

* 

deleted virus genome suggests that the rearrangements may 
involve sequential duplication of the original genome. 

These same procedures may be adapted for production 
of a recombinant AdA virus using a crippled helper virus 
10 or helper virus conjugate as described previously. 

Example 4 - Recombinant AdA Virus C ontaining a 

Therapeutic Miniqeng 

To test the versatility of the recombinant AdA virus 
15 system, the reporter LacZ minigene obtained from 

pAdAGMVLacZ was cassette replaced with a therapeutic 
minigene encoding CFTR. 

The minigene contained human CFTR cDNA [Riordan et 
al, Science . 245; 1066-1073 (1989); nucleotides 8622-4065 
20 of SEQ ID NO: 3] under the transcriptional control of a 
chimeric CMV enhancer /chicken B-actin promotor element 
(nucleotides +1 to +275 as described in T. A. Kost et al, 
Nucl. Acids Res. - 11(23) :8287 (1983); nucleotides 9241- 
8684 of SEQ ID NO: 3, Fig. 7); and followed by an SV-40 
25 poly-A sequence (nucleotides 3887-3684 of SEQ ID NO: 3, 
Fig. 7). 

The CFTR minigene was inserted into the El deletion 
site of an Ad5 virus (called pAd.ElA) which contains a 
deletion in Ela f rom mu 1-9.2 and a deletion in E3 from 

30 mu 78.4-86. 

The resulting shuttle vector called pAdA • CBCFTR (see 
Figs. 6 and the DNA sequence of Fig. 7 [SEQ ID NO: 3]) 
used the same Ad ITRs of pAdACMVLacZ , but the Ad5 
sequences terminated with Nhel sites instead of EcoRI. 
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Therefore release of the mini,... f rem the plesmld we. 

accomplished by digestion with Hhel. 

The vector production system described in Exempl. 3 
was ployed, using the helper virus «^J^ 
2 , Monolayers of 293 cells giown to 80-90* confluency 
I 15 0 mm culture dishes were infected with the^ 
virus st an HOI of 5. Infection, were done in DHEK 
supplemented with 2% PBS et 20 el media/ISO mm Plate. 
Two hours post-infaction, 50 fig *1— U ■» *- J ' 5 * 
«.nsfection cocKt.il wes edded to each plat, and evenly 



25 



30 




Delivery of th. pAdA.CBCFTR pl.e-id to 291 c.11^ «» 
mediated by f creation of a calcium phosphate precipitate 
and Z cbLtk virus resolve from Ad.CBhpaP h.lp.- - 
by c.Cl buoyent density ultr.centrifugatlon a. follow. . 

cells were left in this condition for 10-14 h, 
afterwhich th. infection/transfectlon was replied 

with 20 ml fr..h DMSM/2% FBS. Approximately 20 h post 
transaction, call, were harvested, suspended inlO * 
Tris-Cl (pH 0.0) buffer (0.5 ml/150 mm plet.) . and stored 
at -80°C. 

Frozen cell suspensions were lyaed by three 
seguential rounds of freeze (ethanol-dry 
,l7'C). Cell debris was removed by centrifug.tion (5,000 

I", for 10 .in, end 10 ml £ 

r is-Cl (PH o'o) buffer. Conjugation was performed 
at 20.000 rpm in a B.ci™m SW-20 rotor for 8 h at 4 c. 
factions (1.0 .1, were collected fro. the bottom of th. 
centrifuge tube and analyzed for rlAd "ensduoxn, 
vsctors. P.aX fraction, were combined and band* ^ 

a^librium. "f-^,^ TZl 
were dialyzed against 20 mM HEPES ipn u 
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(HBS) and stored frozen at -80°C in the presence of 10% 
glycerol or as a liquid stock at -20°C (HBS+40% 
glycerol) • 

Fractions collected after ultracentrifugation were 
5 analyzed for transgene expression and vector DNA. For 
lacZ ArAd vectors, 2 Ml aliquots were added to 293 cell 
monolayers seeded in 35 mm culture wells. Twenty-four 
hours later cells were harvested, suspended in 0.3 ml 10 
mM Tris-Cl (pH 8.0) buffer, arid lysed by three rounds of 

10 freeze-thaw. Cell debris was removed by centrifugation 
(15,000 x g for 10 min) and assayed for total protein 
[Bradford, (1976)] and p-galactosidase activity [Sambrook 
et al, (1989)] using ONPG (o-Nitrophenyl p-D- 
galactopyranoside) as substrate. 

15 Expression of CFTR protein from the AdA • CBCFTR 

vector was determined by immunofluorescence localization. 
Aliquots of AdA .CBCFTR, enriched by two-rounds of 
ultracentrifugation and exchanged to HBS storage buffer, 
were added to primary cultures of airway epithelial cells 

20 obtained from the lungs of CF transplant recipients. 
Twenty-four hours after the addition of vector, cells 
were harvested and affixed to glass slides using 
centrifugal force (Cytospin 3, Shandon Scientific 
Limited) . Cells were fixed with freshly prepared 3% 

25 paraformaldehyde in PBS (1.4 mM KH 2 P0 4 , 4.3 mM Na 2 HP0 4 , 
2.7 mM KC1, and 137 mM NaCl) for 15 min at room 
temperature (RT) , washed twice in PBS, and permeabilized 
with 0.05% NP-40 for 10 min at RT. The 
immunofluorescence procedure began with a blocking step 

30 in 10% goat serum (PBS/6S) for 1 h at RT, followed by 

binding of the primary monoclonal mouse anti-human CFTR 
(R-domain specific) antibody (Genzyme) diluted 1:500 in 
PBS/GS for 2 h at RT. Cells were washed extensively in 
PBS/GS and incubated for 1 h at RT with a donkey anti- 

35 mouse IgG (H+L) FITC conjugated 
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antibody (Jackson immunoKesearch Laboratories) diluted 

lil0<> F rr Xn anaxysis of vector 0*A, 5 ,1 allots 
W ere taken directly fro. CsCl fractions and incubated 
with 20 Ml capsid digestion bulfer (50 - Txis-Cl pH 
8.0; 1.0 mM EDTA, PH 8.0; 0.5% SDS, and 1.0 mg/*l 
P^einase K) at 50-C for 1 h. The reactions were 
allowed to cool to RT, loading dye was added, and 
electrophoresed through a 1.2% agarose gel. Re "^ 
DnIs were electroblotted onto a nylon membrane (Hybond-N) 
and h^ridized with a 32-P labeled restriction fragment. 
Blots were analyzed by autoradiography or scanned on a 
Phosphorimager 445 SI (Molecular Dynamics). 

The results that were obtained from Southern blot 
analysis of gradient fractions revealed a distinct viral 
^:Z>~* -ter than the ^ 
The highest viral titers mapped to fractions 3 and 4. 
Tantitation of the bands in fraction 4 indicated the 
titer of Ad . CBhpAP was approximately 1.5x greatex than 
AdACBCFTR. However, if the size difference between the 
two viruses is factored in [^^^ 
AdACBCFTR-6.2 kb) , the viral titer (where 
DNA molecule) of AdACB.CFTR is at least 4-fold greater 

than the viral titer of Ad. CBhpAP. fract ions 
While southern blot analysis of gradient fractions 
was useful for showing the production of AdA viral 
particles, it also demonstrated the utility of 
ultracentrifugation for purifying AdA viruses 
considering the latter of these, both LacZ and^ CFTR 
transducing viruses banded in CsCl to an intermediate 
density between infectious adenovirus helper vi rions 
(1 .34 g/ml) and incompletely formed . -^V ( ^ W 
The lighter density relative to helper virus likely 
results from the smaller genome carried by the AdA 
, viruses. This further suggests changes in virus size 
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influences the density and purification of AdA virus. 
Regardless, the ability to separate AdA virus from the 
helper virus is an important observation and suggests 
further purification may be achieved by successive rounds 
5 of banding through CsCl. 

This recombinant virus is useful in gene therapy 
alone, or preferably, in the form of a conjugate prepared 
as described herein. 

10 Example 5 = Correction of Genetic Def ect in CF alrwav 
Epithelial Cells wi th AdACB.CTTR 

Treatment of cystic fibrosis, utilizing the 
recombinant virus provided above, is particularly suited 
for in vivo, lung-directed, gene therapy. Airway 

15 epithelial cells are the most desirable targets for gene 
transfer because the pulmonary complications of CF are 
usually its most morbid and life-limiting. 

The recombinant Ad aCB • CFTR virus was fractionated on 
sequential CsCl gradients and fractions containing CFTR 

20 sequences, migrating between the adenovirus and top 

components fractions described above were used to infect 
primary cultures of human airway epithelial cells derived 
from the lungs of a CF patient. The cultures were 
subsequently analyzed for expression of CFTR protein by 

25 immunocytochemistry. Immunof luorescent detection with 
mouse anti-human CFTR (R domain specific) antibody was 
performed 24 hours after the addition of the recombinant 
virus. Analysis of mock infected CF cells failed to 
reveal significant binding to the R domain specific CFTR 

30 antibody. Primary airway epithelium cultures exposed to 
the recombinant virus demonstrated high levels of CFTR 
protein in 10-20% of the cells. 

Thus, the recombinant virus of the invention, 
containing the CFTR gene, may be delivered directly into 

35 the airway, e.g. by a formulating the virus above, into a 
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preparation which « b. inhaled. °" 
recort.in.nt virus or connate of the ln "'f ° n 
containing the cm gen., i. by 
eodiu» chlorid.. The virus or conjug.t. is taken up 
respiratory airway ceil, and tU gene is «p».«d. 

Alternatively, the viru. or con:,u,at.s of the 
invention say he delivered by other suited. |-J^ 
including site-directed injection of the ^ 
the CFTR gene. In the case of CFTR gene delivery, 
Referred solution, for bronchial instillation sr. 
Tttrile saline solutions containing in the rang- , of fro. 

abou t 1 , 10' to I X 10- « - «- 

the range of fro. about l * 10» to l * 10 pru, 
virus of the present invention. _»«_ 
Other sultahl. methods for the treatment of cystic 
f ihro.is by us. of gene therapy recombinant v iru«. * 
this invention say be obtained fro. art d^ussion. 
of other type, of gen. therapy vectors forCF 
exasple, 0. s. Patent Ho. 5,240,846, incorporated by 



20 reference herein. 



25 



30 




Another version of the helper virus 



pAdA shuttle P i« s eff ic ient delivery 

virus capsid. This conj g P ^ ^ 

of shuttle plasmid pAdA shuttle v separate 
helper virus, thereby removing the need for a separate 

e f. B see Fig. 10 for a diagrammatic 
transfection step. See, rig. such a 

outline of this construction. Alternatively, s 
con Ue with a plasmid supplying some Ad genes and the 
he l P er supplying the remaining -^STT^ way 
production of the AdA viral vector proviae 
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to reduce contamination of the helper virus, as discussed 
above. 

Purified stocks of a large-scale expansion of 
Ad • CBhpAP were modified by coupling poly-L-lysine to the 
5 virion capsid essentially as described by K. J. Fisher 
and J. M. Wilson, Biochem. J. . 299;49-58 (1994), 
resulting in an Ad • CBhpAP- (Lys ) n conjugate. The 
procedure involves three steps. 

First, CsCl band purified helper virus Ad. CBhpAP was 

10 reacted with the heterobifunctional crosslinker sulfo- 
SMCC [sulf o- (N-succinimidyl-4- (N-maleimidomethyl) 
cyclohexane-l-carboxylate] (Pierce) . The conjugation 
reaction, which contained 0.5 Big (375 nmol) of sulpho- 
SMCC and 6 x 10 12 A 260 helper virus particles in 3.0 ml of 

15 HBS, was incubated at 30°C for 45 minutes with constant 
gentle shaking. This step involved formation of a 
peptide bond between the active N-hydroxysuccinimide 
(NHS) ester of sulpho-SMCC and a free amine (e.g. lysine) 
contributed by an adenovirus protein sequence (capsid 

20 protein) in the vector, yielding a maleimide-activated 
viral particle. The activated adenovirus is shown in 
Fig. 10 having the capsid protein fiber labeled with the 
nucleophilic maleimide moiety. In practice, other capsid 
polypeptides including hexon and penton base are also 

25 targeted. 

Unincorporated, unreacted cross- linker was removed 
by gel filtration on a 1 cm x 15 cm Bio-Gel P-6DG (Bio- 
Rad Laboratories) column equilibrated with 50 mM Tris/HCl 
buffer, pH 7.0, and 150 mM NaCl. Peak A 260 fractions 

30 containing maleimide-activated helper virus were combined 
and placed on ice. 

Second, poly-L- lysine having a molecular mass of 58 
kDa at 10 mg/ml in 50 mM triethanolamine buffer (pH 8.0), 
150 mM NaCl and 1 mM EDTA was thiolated with 2- 

35 imminothiolane/HCl (Traut's Reagent; Pierce) to a molar 
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ratio ot 2 mole. -SH/mol. polylysim. under N 2 ; th. cyclic 
tnioLiLt. react, with th. p 0 ly, L -lyeine, ^J^T 
resulting in . thioleted polycetion. After . 45 mnut. 
Incubation et roc temperature th. action w^pl - 
to . 1 cm x 15 cm Bio-Gel P«* column equilibrated with 
50 »M Tris/HCl buffer (pH 7.0), 150 m R.C1 and 2 »M 
to remove unincorporated Tr.ut • . Reagent . 

Ou.ntific.tion of free thiol groups w.s .ccomplished 
wl th W. reagent t5.5.-dithio-hi.- t2 -ni«ohen»« 
•cid) ] , reveeling approximately 3-4 mol of -SH/mol of 
;:!y Wyein.) . The coupling r..ction we. inxtreted by 
Ldin, 1 x 10" A, M P.rticl.e of ..leimid.-.ctiv.«d 
helper viru./»g of thioleted poly(L-lysine) "4 
^cubating th. mixture on ice et 4-C for 15 hour, under 
«,on. 2 !m«c.ptoethyi™in. was edded .t th. compl.txon 
of th. r..ction end incubetion cerried out et «™ 
temperetur. for 20 minutes to bloc* unreected .alarmr*. 
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Virus-polylysine conjugates, Ad.CPAP-p(Lys) n , were 
purified away fro* unconjugated poly (L- lysine) by 
rttec.ntrifug.Uo ~ ' ^ ^ V^Z^ 

STJ TT IT <41^. csci in 10 - 

hotter (PH 8.0). centrifugetion ve. et 30,000 g for 2 
tours at i'C. Th. fin.l product was dialysed against 20 
^Hepes buffer (P H 2.8, containing 150 mH H.C1 ,HBS, . 

rnmrlr - - EanatJ ■ r ^^^-^J^* 

Th. formation of Ad.CBhpAP-pLys/pXdi.CBVLacZ 
perticle is initiated by adding 20 W plas-id 
pAdi . CMVLacZ 0HAS to 1.2 X ^ 

oLv. in a fin.l volume ot 0.2 ml DMM and allowing 
Complex to develop at room temperature for "-^ 
minutes. This ratio typically represents th. plasmld DHA 
binding capacity of . standard lot of adenovrru.-pl.ys 
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conjugate and gives the highest levels of plasmid 
transgene expression. 

The resulting trans- infection particle is 
transfected onto 293 cells (4 x 10 7 cells seeded on a 150 
5 mm dish) . Thirty hours after transf ection, the particles 
are recovered and subjected to a freeze/thaw technique to 
obtain an extract. The extract is purified on a CsCl 
step gradient with gradients at 1.20 g/ml, 1.36 g/ml and 
1.45 g/ml. After centrifugation at 90,000 x g for 8 
10 hours, the AdA vectors were obtained from a fraction 

under the top components as identified by the presence of 
LacZ, and the helper virus was obtained from a smaller, 
denser fraction, as identified by the presence of hpAP. 

15 Example 8 - Construction of Modified Helper Viruses with 
Crippled Packaging fPAC) Sequences 

This example refers to Figs. 9A through 9C, 10A and 
10B in the design of modified helper viruses of this 
invention. 

20 Ad5 5 9 terminal sequences that contained PAC domains 

I and II (Fig. 8A) or PAC domains I, II, III, and IV 
(Fig. 8B) were generated by PCR from the wild type Ad5 5 1 
genome depicted in Fig. IB using PCR clones indicated by 
the arrows in Fig. IB. The resulting amplification 

25 products (Fig. 8A and 8B) sequences differed from the 

wild-type Ad5 genome in the number of A-repeats carried 
by the left (5») end. 

As depicted in Fig. 8C, these amplification products 
were subcloned into the multiple cloning site of 

30 pAd.Link.l (IHGT Vector Core). pAd.Link.l is a 

adenovirus based plasmid containing adenovirus m.u. 9.6 
through 16.1. The insertion of the modified PAC regions 
into pAd.Link.l generated two vectors pAd.PACII 
(containing PAC domains I and II) and pAd.PACIV 

35 (containing PAC domains I, II, III, and IV). 
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Thereafter, as depicted in Figs. 10A and 10B, for 
each of these plasmids, a human placenta alka * ine 
phosphatase reporter minigene containing the immediate 
early CMV enhancer /promoter (CMV) , human placenta 
alkaline phosphatase cDNA (hpAH , and SV40 
polyadenylation signal (pA) , was subcloned into each PAC 
vector, generating pAd.PACII.CKVhpAP and 
pAd.PAClV.CMVhpAP, respectively. 

These plasmids were then used as substrates for 
homologous recombination with dl7001 virus, described 
above, by co-transf action into 2*3 cells Homologous 
recombination occurred between the adenovirus map units 
9-16 of the plasmid and the crippled Ad5 virus. The 

results of homologous recombination were helper viruses 
containing Ad5 5- terminal sequences that contained PAC 
domains I and II or PAC domains I, XX, HI . and IV, 
followed by the minigene, and Ad5 3- se ^ enCe ° 9 ; 6 " 7 ** * 
and 87-100. Thus, these crippled viruses are deleted 

the El gene and the E3 gene. 

The plaque formation characteristics of the PAC 
helper viruses gave an immediate indication that the PAC 
modifications diminished the rate and extent of growth. 
Specifically, PAC helper virus plaques did not develop 
until day 14-21 post-transfection, and on duration 
remained small. From previous experience a 
first generation Ad.CBhpAP helper virus with a complete 
left terminal sequence would begin to develop by day 7 

and mature by day 10. 

Viral plaques were picked and suspended in 0.5 ml of 

m the virus stock was used 

DMEM media. A small aliquot of tne 

to infect a fresh monolayer of 293 cells and 
historically stained for recombinant ****** 
phosphatase activity 24 hours post-infection Six of 
eight Ad.PACIV.CMVhpAP (encodes A-repeats I-IV) clones 
that were screened for transgene expression were 
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positive, while all three Ad.PACII.CMVhpAP clones that 
were selected scored positive. The clones have been 
taken through two rounds of plague purification and are 
currently being expanded to generate a working stock. 
5 These crippled helper vitases are useful in the 

production of the AdA virus particles according to the 
procedures described in Example 3. They are 
characterized by containing sufficient adenovirus genes 
to permit the packaging of the shuttle vector genome, but 

10 their crippled PAC sequences reduce their efficiency for 
self-encapsidation. Thus less helper viruses are 
produced in favor of more AdA recombinant viruses. 
Purification of AdA virus particles from helper viruses 
is facilitated in the CsCl gradient, which is based on 

15 the weight of the respective viral particles. This 

facility in purification is a decided advantage of the 
AdA ^vectors of this invention in contrast to adenovirus 
vectors having only El or smaller deletions. The AdA 
vectors even with minigenes of up to about 15 kb are 

20 significantly different in weight than wild type or other 
adenovirus helpers containing many adenovirus genes. 

Example 9 - AdA Vector Containing a full-lencrth 

25 Duchenne muscular dystrophy (DMD) is a common x- 

linked genetic disease caused by the absence of 
dystrophin, a 427K protein encoded by a 14 kilobase 
transcript. Lack of this important sarcolemmal protein 
leads to progressive muscle wasting, weakness, and death. 

30 One current approach for treating this lethal disease is 
to transfer a functional copy of the dystrophin gene into 
the affected muscles. For skeletal muscle, a 
replication-defective adenovirus represents an efficient 
delivery system. 
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According to the present invention, a recombxnant 
plasmid pAdA . CMVmdy s was created which contains only the 
Ad5 cis-elements (i.e., ITRs and contiguous pacing 
sequences) and harbors the full-length murine dystrophxn 
gene driven by the CHV promote.-. This plasmid was 

generated as follows. 

PSL1180 [Pharmacia Biotech] was cut with Not I, 
filled in by Klenow, and religated thus ablating the Not 
I site in the plasmid. The resulting plasmxd xs termed 
PSL1180NN and carries a bacterial ori and Amp resistance 
gene. T 

(AdA.CMVMcZ of EXM»pl« 1 «■ <=»* Wlth EOORI ' „ 

renewed, .nd lig.ted with the Ap.I-=ut pSLlUOHN to for. 

pAdA.CMV.LacZ (Apal) . 

The 14 kb mouse dystrophin cDNA [sequences 
provided in C. C. Lee et al, 1*^, 3^:334-336 (1991)1 
was cloned in two large fragments using a lambda ZAP 
cloning vector (Stratagene) and subsequently cloned xnto 
the bluescript vector pSK- giving rise to the 
pCCL-DMD. A schematic diagram of this vector xs provided 
in Fig. 11, which illustrates the restriction enzyme 

SiteS * pAdA.CMVLac* (Apal) was cut with NotI and the 
large fragment gel isolated away from the lacZ cDNA. 
pCCL-DMD was also cut with Not I, gel isolated and 
subseqently ligated to the large HotI f «^ f °^°" 
digested pAdA.CMVLacZ (Apal) . The sequences of resulting 
vector, pAdA.CMVmdys, are provided in Fxg. 12A-12P [SEQ 

ID NO: 10] . , , . 

This plasmid contains sequences form the left 

end of the Ad5 encompassing bp 1-360 (5' ITR) , a mouse 
dystrophin minigene under the control of the CMV 

„ B the riqht end of Ad5 spannxng 

promoter, and sequence from tne rxgnt 



WO 96/13597 PCTAJS95/14017 



49 

bp 35353 to the end of the genome (3 f ITR) . The minigene 
is followed by an SV-40 poly- A sequence similar to that 
described for the plasmids described above. 

The vector production system described herein is 
5 employed. Ten 150mm 293 plates are infected at about 90% 
confluency with a reporter recombinant El-deleted virus 
Ad.CBhpAP at an MOI of 5 for 60 minutes at 37 °C. These 
cells are transfected with pAdA. CMVmDys by calcium 
phosphate co-precipitation using 50 linearized 

10 DNA/dish for about 12-16 hours at 37°C. Media is 
replaced with DMEM + 10% fetal bovine serum. 

Full cytopathic effect is observed and a cell lysate 
is made by subjecting the cell pellet to freeze-thaw 
procedures three times. The cells are subjected to an 

15 SW41 three tier CsCl gradient for 2 hours and a band 
migrating between the helper adenovirus and incomplete 
virus is detected. 

Fractions are assayed on a 6 well plate containing 
293 cells infected with 5X of fraction for 16-20 hours in 

20 DMEM + 2% FBS. Cells are collected, washed with 

phosphate buffered saline, and resuspended in 2 ml PBS. 
200X of the 2ml cell fractions is cytospun onto a slide. 

The cells were subjected to immunofluorescence for 
dystrophin as follows. Cells were fixed in 10N MeOH at 

25 -20°C. The cells were exposed to a monoclonal antibody 
specific for the carboxy terminus of human dystrophin 
[NCL-DYS2 ; Novocastra Laboratories Ltd., UK]. Cells were 
then washed three times and exposed to a secondary 
antibody, i.e. 1:200 goat anti-mouse IgG in FITC. 

30 The titer /fraction for seven fractions revealed in 

the immunof luorescent stains were calculated by the 
following formula and reported in Table 2 below. 
DFU/f ield = (DFU/200X cells) x 10 = DFU/10 6 cells - 
(DFU/5X viral fraction) x 20 = DFU/100X fraction. 
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Table 2 

nyy/lOOX 

c 



1 

5 2 

6 X 10 s 

3 

1.8 X 10* 

10 * 

9.6 X 10 s 

5 

200 

6 

15 200 

7 

A virus capable of transducing the dystrophin 

miniqene is detected as a "positive" (i.e., green 
minigene x» « illustrate that 

fluorescent) cell. The results of the IF 
heat-treated fractions do not show positive 

* lu orescence. Southern blot data suggest one 
immunofluorescence. ^ helpcr 

species on the same size as tne mpu 



virus contamination. BOnar ated 
T h. recombinant virus can be sub.egu.ntly separated 

£r o» the Majority of helper virus by -f-""**-. 

trough cesium gradients. Initial stud.es 

It the functional AdCHVdmDys virions are produced, but 

ere contaminated with helper virus. le 

purification would re.^- ^'transducing 

of encoding viral proteins but are capao 

murine skeletal muscle. 

>«mr»le i n - pse "d"tvpinq , _ 

amp4,e av " ^,-nvides a method for 

The following experiment provides a 

.repalin, a recombinant ^^^7^ 

TZ ^fectWinfection protocol 
Tt is unacted that the ITPs and packing serene, of 
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Ad5 could be incorporated into a virion of another 
serotype. 

A. PrPtPPPl 

The basic approach is to transfect the 
5 AdA.CMVlacZ recombinant virus (Ad5) into 293 cells and 
subsequently infect the cell with the helper virus 
derived from a variety of Ad serotypes (2, 3, 4, 5, 7, 8, 
12, and 40). When CPE is achieved, the lysate is 
harvested and banded through two cesium gradients* 

10 More particularly, the Ad5-based plasmid 

pAdA.CMVIacS of Example 1 was linearized with EcoRI. The 
linearized plasmids were then transfected into ten 150 mm 
dishes of 293 cells using calcium phosphate co- 
precipitation. At 10*15 hours post transfection, wild 

15 type adenoviruses (of one of the following serotypes: 2, 
3, 4, 5, 7, 12, 40) were used to infect cells at an MOI 
of 5. The cells were then harvested at full CPE and 
lysed by three rounds of freeze-thawing. Pellet is 
resuspended in 4 mL Tris-HCl. Cell debris was removed by 

20 centrifugation and partial purification of Ad5A. OWlacZ 
from helper virus was achieved with 2 rounds of CsCl 
gradient centrifugation (SW41 column, 35,000 rpm, 2 
hours) . Fractions were collected from the bottom of the 
tube (fraction fl) and analysed for lacZ transducing 

25 viruses on 293 target cells by histochemical staining (at 
2 Oh PI). Contaminating helper viruses were quant itated 
by plaque assay. 

Except for adenovirus type 3, infection with Ad 
serotypes 2, 4, 5, 7, 12 and 40 were able to produce lacZ 

30 transducing viruses. The peak of p-galactosidase 
activity was detected between the two major A 2 go 
absorbing peaks, where most of the helper viruses banded 
(data not shown) . The quantity of lacZ virus recovered 
from 10 plates ranged from 10 4 to 10 B transducing 

35 particles depending on the serotype of the helper. As 



PCTAJS95/14017 



WO 96/13597 



52 



expected Ad2 and Ad5 produced the highest tit- :j« 
transducing viruses (Table 3). Wild type e«t««rt«» 
was in general lO 2 -^ log high er than corresponding lac, 
titer except in the case of Ad40. 



10 



S5H M l» the growth characteristic, 
of the wiid type ecencviru.es „ ev.Ic.t- |« J^"" 
,„ ,„ ceUs . Thi. eemcn.tr.tec. the fee.ibility of 
I IZ th... heiper viru.e. to infect the ceil line 
which he. heen trenefectec with the M 5 deieted vrru.. 



15 



20 



25 



30 





Table 3 






s serotyi 


ses P/ial 


pfu/ml 


p:pfu 


2 


5 X 10 12 


2.5 X 10 11 


20:01 


3 


1 X 10 12 


6.25 X 10 9 


160:1 


4 


3 X 10 12 


2 X 10 9 


150:1 


5 


1 X 10 12 


5 X 10 10 


20:01 


7a 


5 X 10 12 


1 X 10 11 


50:1 


12 


6 X 10 11 


4 X 10 9 


150:1 


35 


1.2 X 10 12 






40 


2.2 X 10 12 


4.4 X 10 8 


5000:1 



35 



40 



Table 4 summarizes the results of the final 
c The m i d dle column, labeled LFU/»*l 
purified fractions. The miaaxe wh 4 C h is 

guentifie. th. production of UcZ tarn* unit.. *«* 
rdirect «..ur. of th. peeking end P"W"~* 

, vi^nt AdA virus. The pfu/M* titer 18 

pseudotyped recombinant AdA ™™ s AdA 

an estimate of the contaminating wild type viru 



io 4 
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Table 4 
Serotypes LFU/ml 
2 4.6 x 10 



PFU/ml 
1.8 X 10 9 



10 



15 



20 



25 



30 



35 



40 



45 



3 
4 
5 
7a 
12 
40 



6.7 x 10° 
6.3 X 10 7 
3 x 10 6 
1.2 X 10 5 



9.5 X 10' 



NA 

9.3 X 10 7 
1.9 X 10 9 
1.8 X 10 8 
3.3 X 10 8 
1.5 X 10 3 



Table 5A-5D represents a more detailed analysis 
of the fractions from the second purification for each of 
the experiments summarized in Table 4. Again, LFU/mI is 
the recovery of the AdA viruses, whereas pfu/pl 
represents recovery of the helper virus. 





Table 


5A 


4 


Ad2 Fraction # 


VOLUME/ul 


LFU/Ul 


PFU/ul 


1 


120 


9532 


8 X 10 6 


2 


100 


5.8 X 10 4 


3 X 10 6 


3 


100 


8.24 X 10 4 


6 X 10 5 


4 


100 


9.47 X 10 4 


1.2 X 10 5 


5 


* 

100 


6 X 10 4 


8 X 10 4 


6 


100 


2 X 10 4 


6 X 10 4 


7 


100 


5434 


5 X 10 4 


Total/ 10 pH 




3.32 X 10 7 


1.35 X 10 9 
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Table 5B 



10 



15 



20 



Ad 4 Fraction # 



2 
3 



VOLUME/Ul 

100 

100 

100 

100 

100 

100 



Total/ 10 pH 



LFU/Ul 
1000 
1.79 X 10 4 
1.8 X 10 4 
2909 
920 
153 
4 X 10 € 



PFU/ul 
1.75 X 10 5 
2.8 X 10 5 

5.5 X 10 4 
1.25 X 10 4 

4 X 10 4 
3 X 10 3 

5.6 X 10 7 



25 Xd5 Fraction # 



30 



35 



2 
3 



Total/ 10 pH 



120 

100 

100 

100 

100 

100 



1.98 X 10 4 
5.8 X 10 4 
1.2 X 10 5 
1 X 10 5 
7.96 X 10 4 

6860 
3.88 X 10 7 



6 X 10 6 
3 X 10 6 
1.5 X 10 6 
1.4 X 10 5 
8 X 10 4 
6 X 10 4 
1.2 X 10 9 



40 
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Table 5C 



10 



15 



20 



Ad 7 Fraction # 
1 
2 
3 
4 
5 
6 
7 

Total/ 10 pH 



VOLUME/ ul 
100 
100 
100 
100 
100 
100 
100 



LFU/Ul 

1225 

5550 

4938 

3866 

4134 

995 

230 
2.09 X 10 6 



PFU/Ul 



5 X 10 



4 X 10 s 
2 X 10 5 



8 X 10 



6 X 10 



7 X 10 



6 x 10 



1.3 X 10 



8 



25 



30 



35 



Adi2 Fraction # 
1 
2 
3 
4 
5 

Total/10 pH 



100 
80 
80 
110 
120 



31 
169 
245 
161 

62 

6.14 X 10' 



5 


X 


10* 


8.5 


X 


10 5 


1.8 


X 


10 s 


1.1 


X 


10 s 


7 


X 


10 3 


1.65 


X 


10* 
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Table 5D 





Ad40 Fraction # 


VOLUME/Ul 


LFU/U1 


5 


1 


80 


61 




2 


80 


184 


10 


3 


80 


199 




4 


80 


168 




5 


80 


122 


15 


6 


100 


46 




7 


100 


32 




Total/ 10 pH 




6.65 X 10* 


20 





PFU/ul 
5 
3 
3 
1 



1.1 X 10- 



25 



30 



35 



40 



C. 



fro Stras tars of 



Aliquots of serial fractions were analysed by 
southern blots using lac, as a probe. In the case of Ad2 
and 5, not only the linearized monomer was packaged but 

* „„uir,*r,i- virus with distinct sizes 
multiple forms of recombinant virus wiui 

were found. These forms correlated well with the sizes 
of dimers, trimers and other higher molecular weight 
concatamers. The linearized monomers peaked closer to 
the top of tube (the defective adenovirus band) than 
other forms. When these forms were correlated with lac* 
activity, a better correlation was found between the 
higher molecular weight forms than ^^^^'J^r. 
pseudotyping of Ad4 and Ad7, no linearized ^omers were 
packaged and only higher molecular weight forms were 

f These data definitively demonstrate the 

production and characterization of the A virus and the 
Afferent pseudotypes. This example illustrates a very 
simple way of generating pseudotype viruses. 
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Example 11 - AdA Vector Containing a FH Gene 

Familial hypercholesterolemia (FH) is an autosomal 
dominant disorder caused by abnormalities (deficiencies) 
in the function or expression of LDL receptors [M.S. 
5 Brown and J.L. Goldstein, Scifc.ice . 222 (4746) :34-37 
(1986); J.L. Goldstein and M.S. Brown, "Familial 
hypercholesterolemia" in Metabolic Basis of Inherited 
Disease. . ed. C.R. Scriver et al, McGraw Hill, New York, 
pp!215-1250 (1989).] Patients who inherit one abnormal 

10 allele have moderate elevations in plasma LDL and suffer 
premature life-threatening coronary artery disease (CAD) . 
Homozygous patients have severe hypercholesterolemia and 
life-threatening CAD in childhood. An FH-containing 
vector of the invention is constructed by replacing the 

15 lacZ minigene in the pAdAc.CMVlacZ vector with a minigene 
containing the LDL receptor gene [T. Yamamoto et al, 
Cell . 12:27-38 (1984)] using known techniques and as 
described analogously for the dystrophin gene and CFTR in 
the preceding examples. , Vectors bearing the LDL receptor 

20 gene can be readily constructed according to this 

invention. The resulting plasmid is termed pAdAc.CMV- 
LDL. 

This plasmid is useful in gene therapy of FH alone, 
or preferably, in the form of a conjugate prepared as 
25 described herein to substitute a normal LDL gene for the 
abnormal allele responsible for the gene. 

a. Ex Vivp <?ene Therapy 

Ex vivo gene therapy can be performed by 
harvesting and establishing a primary culture of 

30 hepatocytes from a patient. Known techniques may be used 
to isolate and transduce the hepatocytes with the above 
vector (s) bearing the LDL receptor gene(s). For example, 
techniques of collagenase perfusion developed for rabbit 
liver can be adapted for human tissue and used in 

35 transduction. Following transduction, the hepatocytes 
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10 



15 



20 



are removed fro* the tissue culture plates and reinfused 
into the patient using known techniques, e.g. via a 
catheter placed into the inferior mesenteric vein. 



25 



30 



B. Jn Vive 

Desirably, th. In vi.o approach to gene 

_ n„.,--directed, involves the uss of the 
therapy, e.g. liver-direcceu, i 

vectore and vector conjugates described above. A 
preferred treetment involve infusing a vector WL 
conjugate of this invention into the P^*"* 5 "* 
ci rculation of the patient. Th. patient « then 
evaluated for change in ..rum lipids and U~ tissue. 

The virus or conjugate can be used to infect 
hepatocyte. in vivo by direct injection int. a P"-**"* 1 
cr portTl vein «10'-10» pfu/*g> or retrograde into the 
binary tract (sam. dose) . This effect, gene transfer 
into the majority of hepatocytes. 

Tr.at.ente are repeated as nsc.ss.ry, e.g. 
weekly. Administration of a doss of virus V 
ZZ of approximately 20 (i.e. *0 pfu/hepatocyte , is 
anticipated to lead to high level gene expression in th. 

Mior r^ above «. ^ 

^ reference. Humorous modif ictions and variation, of 
L pr.s.nt invention are included in the ^ 
identified specification and are expected to be obviou 

oon. of ./ill in the art. Such modificat on. and 
alternation, to th. compositions and processes of the 
Resent invention, such a. various *° 
PAC sequences or the shuttle vectors, or to other 

of the vector, helper virus and minigene 
Zon^s - -lieved to be encompassed in th. scop. 

of the claims appended hereto. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Trustees of the University of Pennsylvania 

Wilson, James M. 
Fisher, Krishna J. 
Chen, Shu- Jen 
Weitzman, Matthew 

(ii) TITLE OF INVENTION: Improved Adenovirus and Methods 

of Use Thereof 

(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Hovson and Hovson 

(B) STREET: Spring House Corporate Cntr, PO Box 457 

(C) CITY: Spring House 

(D) STATE: Pennsylvania 

(E) COUNTRY: USA 

(F) ZIP: 19477 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/331,381 

(B) FILING DATE: 28-OCT-1994 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Bak, Mary E. \ 

(B) REGISTRATION NUMBER: 31,215 

(C) REFERENCE /DOCKET NUMBER: GNVPN. 008PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-540-9200 

(B) TELEFAX: 215-540-5818 
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(2) INFORMATION FOR SEQ ID NO:l: 

fil SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GAACTCGAGC AGCTGAAGCT TGAATTCCAT CATCAATAAT ATACCTTATT 
TTGGATTGAA GCCAATATGA TAATGAGGGG GTGGAGTTTG TGACGTGGCG 
CGGGGCGTGG GAACGGGGCG GGTGACGTAG GTTTTAGGGC GGAGTAACTT 
GTATGTGTTG GGAATTGTAG TTTTCTTAAA ATGGGAAGTT ACGTAACGTG 
OO^CGGA AGTGACGATT TGAGGAAGTT GTGGGTTTTT TGGCTTTCGT 
TTCTGGGCGT AGGTTCGCGT GCGGTTTTCT GGGTGTTTTT TGTGGACTTT 
AACCGTTACG TCATTTTTTA GTCCTATATA TACTCGCTCT GCACTTGGCC 
CTTTTTTACA CTGTGACTGA TTGAGCTGGT GCCGTGTCGA GTGGTGTTTT 
TTTAATAGGT TTTCTTTTTT ACTGGTAAGG CTGACTGTTA GGCTGCCGCT 
GTGAAGCGCT GTATGTTGTT CTGGAGCGGG AGGGTGCTAT TTTGCCTAGG 
CAGGAGGGTT TTTCAGGTGT TTATGTGTTT TTCTCTCCTA TTAATTTTGT 
TATACCTCCT ATGGGGGCTG TAATGTTGTC TCTACGCCTG CGGGTATGTA 
TTCCCCCCAA GCTTGCATGC CTGCAGGTCG ACTCTAGAGG ATCCGAAAAA 
AC CTCCCACA CCTCCCCCTG AACCTGAAAC ATAAAATGAA TGCAATTGTT 
GTTGTTAACT TGTTTATTGC AGCTTATAAT GGTTACAAAT AAAGCAATAG 
CATCACAAAT TTCACAAATA AAGCATTTTT TTCACTGCAT TCTAGTTGTG 
GTTTGTCCAA ACTCATCAAT GTATCTTATC ATGTCTGGAT CCCCGCGGCC 
GCCTAGAGTC GAGGCCGAGT TTGTCAGAAA GCAGACCAAA CAGCGGTTGG 
AATAATAGCG AGAACAGAGA AATAGCGGCA AAAATAATAC CCGTATCACT 
TTTGCTGATA TGGTTGATGT CATGTAGCCA AATCGGGAAA AACGGGAAGT 



50 

100 

150 

200 

250 

300 

350 

400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
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AGGCTCCCAT 


GATAAAAAAG 


TAAAAGAAAA 


AGAATAAACC 


GAACATCCAA 


1050 


AAGTTTGTGT 


TTTTTAAATA 


GTACATAATG 


GATTTCCTTA 


CGCGAAATAC 


1100 


GGG CAGACAT 


GGCCTGCCCG 


GTTATTATTA 


TTTTTGACAC 


CAGACCAACT 


1150 


GGTAATGGTA 


GCG AC CGGCG 


CTCAGCTGTA 


AJTCCGCCGA 


TACTGACGGG 


1200 


CTCCAGGAGT 


CGTCGCCACC 


AATCCCCATA 


TGGAAACCGT 


CGATATTCAG 


1250 


CCATGTGCCT 


TCTTCCGCGT 


GCAGCAGATG 


GCGATGGCTG 


CTTTCCATCA 


1300 


GTTGCTGTTG 


ACTGTAGCGG 


CTGATGTTGA 


ACTGGAAGTC 


GCCGCGCCAC 


1350 


TGGTGTGGGC 


CATAATTCAA 


TTCGCGCGTC 


CCGCAGCGCA 


GACCGTTTTC 


1400 


GCTCGGGAAG 


ACGTACGGGG 


TATACATGTC 


TGACAATGGC 


AGATCCCAGC 


1450 


GGTCAAAACA 


GGCGGCAGTA 


AGGCGGTCGG 


GATAGTTTTC 


TTGCGGCCCT 


1500 


AATCCGAGCC 


AGTTTACCCG 

* 


CTCTGCTACC 


TGCGCCAGCT 


GGCAGTTCAG 


1550 


GCCAATCCGC 


GCCGGATGCG 


GTGTATCGCT 


CGCCACTTCA 


ACATCAACGG 


1600 


TAATCGCCAT 


TTGACCACTA 


CCATCAATCC 


GGTAGGTTTT 


CCGGCTGATA 


1650 


AATAAGGTTT 


TCCCCTGATG 


CTGCCACGCG 


TGAGCGGTCG 


TAATCAGCAC 


1700 


CGCATCAGCA 


AGTGTATCTG 


CCGTGCACTG 


CAACAACGCT 


GCTTCGGCCT 


1750 


GGTAATGGCC 


CGCCGCCTTC 


CAGCGTTCGA 


CCCAGGCGTT 


AGGGTCAATG 


1800 


CGGGTCGCTT 


CACTTACGCC 


AATGTCGTTA 


TCCAGCGGTG 


CACGGGTGAA 


1850 


CTGATCGCGC 


AGCGGCGTCA 


GCAGTTGTTT 


TTTATCGCCA 


ATCCACATCT 


1900 


GTGAAAGAAA 


GCCTGACTGG 


CGGTTAAATT 


GCCAACGCTT 


ATTACCCAGC 


1950 


TCGATGCAAA 


AATCCATTTC 


GCTGGTGGTC 


AGATGCGGGA 


TGGCGTGGGA 


2000 


CGCGGCGGGG 


AGCGTCACAC 


TGAGGTTTTC 


CGCCAGACGC 

■ 


CACTGCTGCC 


2050 


AGGCGCTGAT 


GTGCCCGGCT 


TCTGACCATG 


CGGTCGCGTT 


CGGTTGCACT 


2100 


ACGCGTACTG 


TGAGCCAGAG 


TTGCCCGGCG 


CTCTCCGGCT 


GCGGTAGTTC 


2150 


AGGCAGTTCA 


ATCAACTGTT 


TACCTTGTGG 


AGCGACATCC 


AGAGGCACTT 


2200 


CACCGCTTGC 


CAGCGGCTTA 


CCATCCAGCG 


CCACCATCCA 


GTGCAGGAGC 


2250 


TCGTTATCGC 


TATGACGGAA 


CAGGTATTCG 


CTGGTCACTT 


CGATGGTTTG 


2300 
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CCCGGATAAA 
GCGCTGGATG 
TGGCGATCGT 
GTTGCCGTTT 
CGAAGCCGCC 
GCGAAACCGC 
CAGCGGGCGC 
TCGGCACAGC 
CAAATAATAT 
CGGGCGGGAA 
TAGCGCCGTG 
TGATTACGAT 
TAGCCAGCGC 
TTTCAATATT 
GTGTACCACA 
GTTGTTCTGC 
TGACCTGACC 
AGCAACGGCT 
GCGGAAACCG 
TGTGCAGTTC 
AGTTTCGGGT 
ACCACGCTCA 
CCTGCGTTTC 
AACTCGCCGC 
ATCATTAAAG 
TATGCAGCAA 



CGGAACTGGA 
CGGCGTGCGG 
TCGGCGTATC 
TCATCATATT 
CTGTAAACGG 
CAAGACTGTT 
GTCTCTCCAG 
CGGGAAGGGC 
CGGTGGCCGT 
GGATCGACAG 
GCCTGATTCA 
CGCGCTGCAC 
GGATCATCGG 
GGCTTCATCC 
GCGGATGGTT 
TTCATCAGCA 
ATGCAGAGGA 
TGCCGTTCAG 
ACATCGCAGG 
AACCACCGCA 
TTTCGACGTT 
TCGATAATTT 
ACCCTGCCAT 
ACATCTGAAC 
CGAGTGGCAA 
CGAGACGTCA 



AAAACTGCTG 
TCGGCAAAGA 
GCCAAAATCA 
TAATCAGCGA 
GGATACTGAC 
ACCCATCGCG 
GTAGCGAAAG 
TGGTCTTCAT 
GGTGTCGGCT 
ATTTGATCCA 
TTCCCCAGCG 
CATTCGCGTT 
TCAGACGATT 
ACCACATACA 
CGGATAATGC 
GGATATCCTG 
TGATGCTCGT 
CAGCAGCAGA 
CTTCTGCTTC 
CGATAGAGAT 
CAGACGTAGT 
CACCGCCGAA 
AAAGAAACTG 
TTCAGCCTCC 
CATGGAAATC 
CGGAAAATGC 



CTGGTGTTTT 



CCAGACCGTT 
CCGCCGTAAG 
CTJATCCACC 
GAAACGCCTG 
TGGGCGTATT 
CCATTTTTTG 
CCACGCGCGC 
CCGCCGCCTT 
GCGATACAGC 
ACCAGATGAT 
ACGCGTTCGC 
CATTGGCACC 
GGCCGTAGCG 
GAACAGCGCA 
CACCATCGTC 
GACGGTTAAC 
CCATTTTCAA 
AATCAGCGTG 
TCGGGATTTC 
GTGACGCGAT 
AGGCGCGGTG 
TTACCCGTAG 
AGTACAGCGC 



GCTGATTTGT 
CGCTCATCCG 



GCTTCCGTCA 
CATACAGAAC 
CCGACCACGG 
CAGTCCCAGA 
CCAGTATTTA 
CGCAAAGGAT 
ATGGACCATT 
GTACATCGGG 
CATACTGCAC 
GCGTCGTGAT 
CACACTCGGG 
TCATCGCCGG 
ATGCCGTGGG 
GTCGCACAGC 
CGGCGTTAAA 
TGCTCATCCA 
GCCTCGAATC 
TCCGCACCTC 
CCGTCGGCGG 
GGCGCTCCAC 
CGGCATAACC 
CCGCTGGCGA 
GTAGTCACGC 
GGCTGAAATC 
GTAGTCGGTT 
CCACATATCC 



2350 
2400 
2450 
2500 
2550 
2600 
2650 
2700 
2750 
2800 
2850 
2900 
2950 
3000 
3050 
3100 
3150 
3200 
3250 
3300 
3350 
3400 
3450 
3500 
3550 
3600 
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TGATCTTCCA GATAACTGCC GTCACTCCAA 
GCGGTTTTCT CCGGCGCGTA AAAATGCGCT 
AACGACTGTC CTGGCCGTAA CCGACCCAGC 
AACGCCGAGT TAACGCCATC AAAAATAATT 
CCAGCTTTCA TCAACATTAA ATGTGAGCGA 
CCGTGGGAAC AAACGGCGGA TTGACCGTAA 
TAGATGGGCG CATCGTAACC GTGCATCTGC 
AGTATCGGCC TCAGGAAGAT CGCACTCCAG 
CTGGTGCCGG AAACCAGGCA AAGCGCCATT 
TGTTGGGAAG GGCGATCGGT GCGGGCCTCT 
CAAAGGGGGA TGTGCTGCAA GGCGATTAAG 
CCCAGTCACG ACGTTGTAAA ACGACGGGAT 
CTGGTGTCCA GACCAATGCC TCCCAGACCG 
GTTGGTCAAA GTAAACGACA TGGTGACTTC 
TTTCGATCCC CGGGAATTGC GGCCGCGGGT 
GCAGAAGTAA CACTTCCGTA CAGGCCTAGA 
AGGAGCAGTT CTTTGATTTG CACCACCACC 
AAGACAAAAA GACTAAACTT ACCAGTTAAC 
CGAGTACCGG ATCCTCTAGA GTCCGGAGGC 
CTATGGAGGT CAAAACAGCG TGGATGGCGT 
CACTAAACGA GCTCTGCTTA TATAGACCTC 
CCATTTGCGT CAATGGGGCG GAGTTGTTAC 
TGATTTTGGT GCCAAAACAA ACTCCCATTG 
TGGAAATCCC CGTGAGTCAA ACCGCTATCC 
AAAACCGCAT CACCATGGTA ATAGCGATGA 
CAAGTAGGAA AGTCCCATAA GGTCATGTAC 
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CGCAGCACCA 


TCACCGCGAG 


3650 


CAGGTCAAAT 


TCAGACGGCA 


3700 


GCCCGTTGCA 


CCACAGATGA 


3750 


C^CGTCTGGC 


CTTCCTGTAG 


3800 


GTAACAACCC 


GTCGGATTCT 


3850 


TGGGATAGGT 


TACGTTGGTG 


3900 


CAGTTTGAGG 


GGACGACGAC 


3950 


CCAGCTTTCC 


GGCACCGCTT 


4000 


CGCCATTCAG 


GCTGCGCAAC 


4050 


TCGCTATTAC 


GCCAGCTGGC 


4100 


TTGGGTAACG 


CCAGGGTTTT 


4150 


CGCGCTTGAG 


CAGCTCCTTG 


4200 


GCAACGAAAA 


TCACGTTCTT 


4250 


TTTTTTGCTT 


TAGCAGGCTC 


4300 


ACAATTCCGC 


AGCTTTTAGA 


4350 


AGTAAAGGCA 


ACATCCACTG 


440O 


GGATCCGGGA 


CCTGAAATAA 


4450 


TTTCTGGTTT 


TTCAGTTCCT 


4500 




CCGGTCTCTT 


4550 

• 


CTCCAGGCGA 


TCTGACGGTT 


4600 


CCACCGTACA 


CGCCTACCGC 


4650 


GACATTTTGG 


AAAGTCCCGT 


4700 


ACGTCAATGG 


GGTGGAGACT 


4750 


ACGCCCATTG 


ATGTACTGCC 


4800 


CTAATACGTA 


GATGTACTGC 


4850 


TGGGCATAAT 


GCCAGGCGGG 


4900 



PCTAJS95/14017 



WO 96/13597 



64 



CCATTTACCG 
CTTGATGTAC 
CGTCAATGGA 
TGACGTCAAT 
CCGTAAGTTA 
CAAATATTAC 
CTTATTCAGT 
AATTTACTAC 
GTACACCGGC 
ATGTGTTCCG 
ACCCGCCCCG 
TTATCATATT 
CGAATTCATC 
TTAATTTCGA 
GGGGAGAGGC 
ACTCGCTGCG 
AAGGCGGTAA 
CATGTGAGCA 
TGCTGGCGTT 
CGACGCTCAA 
GGCGTTTCCC 
CGCTTACCGG 
TCTCAATGCT 
CAAGCTGGGC 
TATCCGGTAA 
CCACTGGCAG 



TCATTGACGT 
TGCCAAGTGG 
AAGTCCCTAT 
GGGCGGGGGT 
TGTAACGACC 
GCGCTATGAG 
TTTCCCGCGA 
AACATCCGCC 
GCACACCAAA 
CCACACTTGC 
TTCCCACGCC 
GGCTTCAATC 
GATGATATCA 
TAAGCCAGGT 
GGTTTGCGTA 
CTCGGTCGTT 
TACGGTTATC 
AAAGGCCAGC 
TTTCCATAGG 
GTCAGAGGTG 
CCTGGAAGCT 
ATACCTGTCC 
CACGCTGTAG 
TGTGTGCACG 
CTATCGTCTT 
CAGCCACTGG 



CAATAGGGGG 
GCAGTTTACC 
TGGCGTTACT 
CGTTGGGCGG 
TGCAGGTCGA 
TAACACAAAA 
AAATGGCCAA 
TAAAACCGCG 
AACGTCACTT 
AACATCACAC 
CCGCGCCACG 
CAAAATAAGG 
GATCTGCCGG 
TAACCTGCAT 
TTGGGCGCTC 
CGGCTGCGGC 
CACAGAATCA 
AAAAGGCCAG 
CTCCGCCCCC 
GCGAAACCCG 
CCCTCGTGCG 
GCCTTTCTCC 
GTATCTCAGT 
AACCCCCCGT 
GAGTCCAACC 
TAACAGGATT 



CGTACTTGGC 
GTAAATACTC 
ATGGGAACAT 



TC**GCCAGGC 
CTCTAGAGGA 
TTATTCAGAT 
ATCTTACTCG 
CGAAAATTGT 
TTGCCACATC 
TTCCGCCACA 
TCACAAACTC 
TATATTATTG 
TCTCCCTATA 
TAATGAATCG 
TTCCGCTTCC 
GAGCGGTATC 
GGGGATAACG 
GAACCGTAAA 
CTGACGAGCA 
ACAGGACTAT 
CTCTCCTGTT 
CTTCGGGAAG 
TCGGTGTAGG 
TCAGCCCGAC 
CGGTAAGACA 
AGCAGAGCGA 



ATATGATACA 
CACCCATTGA 
ACGTCATTAT 
GGGCCATTTA 
TCTCCCTAGA 
TTCACTTCCT 
GTTACGCCCA 
CACTTCCTGT 
CGTCGCTTAC 
CTACTACGTC 
CACCCCCTCA 
ATGATGCTAG 
GTGAGTCGTA 
GCCAACGCGC 
TCGCTCACTG 
AGCTCACTCA 
CAGGAAAGAA 
AAGGCCGCGT 
TCACAAAAAT 
AAAGATACCA 
CCGACCCTGC 
CGTGGCGCTT 
TCGTTCGCTC 
CGCTGCGCCT 
CGACTTATCG 
GGTATGTAGG 



4950 
5000 
5050 
5100 
5150 
5200 
5250 
5300 
5350 
5400 
5450 
5500 
5550 
5600 
5650 
5700 
5750 
5800 
5850 
5900 
5950 
6000 
6050 
6100 
6150 
6200 
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CGGTGCTACA GAGTTCTTGA AGTGGTGGCC 
GGACAGTATT TGGTATCTGC GCTCTGCTGA 
AGAGTTGGTA GCTCTTGATC CGGCAAACAA 
TTTTTTTGTT TGCAAGCAGC AGATTACGCG 
AAGATCCTTT GATCTTTTCT ACGGGGTCTG 
TCACGTTAAG GGATTTTGGT CATGAGATTA 
GATCCTTTTA AATTAAAAAT GAAGTTTTAA 
AGTAAACTTG GTCTGACAGT TACCAATGCT 
TCAGCGATCT GTCTATTTCG TTCATCCATA 
GTAGATAACT ACGATACGGG AGGGCTTACC 
TGATACCGCG AGACCCACGC TCACCGGCTC 
CAGCCAGCCG GAAGGGCCGA GCGCAGAAGT 
CTCCATCCAG TCTATTAATT GTTGCCGGGA 
CAG TT AAT AG TTTGCGCAAC GTTGTTGCCA 

i 

TCACGCTCGT CGTTTGGTAT GGCTTCATTC 
AAGGCGAGTT ACATGATCCC CCATGTTGTG 
TCGGTCCTCC GATCGTTGTC AGAAGTAAGT 
ATGGTTATGG CAGCACTGCA TAATTCTCTT 
ATGCTTTTCT GTGACTGGTG AGTACTCAAC 
GTATGCGGCG ACCGAGTTGC TCTTGCCCGG 
GCGCCACATA GCAGAACTTT AAAAGTGCTC 
GGGGCGAAAA CTCTCAAGGA TCTTACCGCT 
AACCCACTCG TGCACCCAAC TGATCTTCAG 
GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA 
AAGGGCGACA CGGAAATGTT GAATACTCAT 
ATTGAAGCAT TTATCAGGGT TATTGTCTCA 
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TAACTACGGC 


TACACTAGAA 


6250 


AGCCAGTTAC 


CTTCGGAAAA 


6300 


ACCACCGCTG 


CTAGCGGTGG 


6350 


OtGAAAAAAA 


GGATCTCAAG 


6400 


ACGCTCAGTG 


GAACGAAAAC 


6450 


TCAAAAAGGA 


TCTTCACCTA 


6500 


ATCAATCTAA 


AGTATATATG 


6550 


TAATCAGTGA 


GGCACCTATC 


6600 


GTTGCCTGAC 


TCCCCGTCGT 


6650 


ATCTGGCCCC 


AGTGCTGCAA 


6700 


CAGATTTATC 


AGCAATAAAC 


6750 


GGTCCTGCAA 


CTTTATCCGC 


6800 


AGCTAGAGTA AGTAGTTCGC 


6850 


TTGCTACAGG 


CATCGTGGTG 


6900 


AGCTCCGGTT 


CCCAACGATC 


6950 


CAAAAAAGCG 


GTTAGCTCCT 


7000 


TGGCCGCAGT 


GTTATCACTC 


7050 


ACTGTCATGC 


CATCCGTAAG 


7100 


CAAGTCATTC 


TGAGAATAGT 


7150 


CGTCAATACG 


GGATAATACC 


7200 


ATCATTGGAA 


AACGTTCTTC 


7250 


GTTGAGATCC 


AGTTCGATGT 


7300 


CATCTTTTAC 


TTTCACCAGC 


7350 


AATGCCGCAA 


AAAAGGGAAT 


7400 


ACTCTTCCTT 


TTTCAATATT 


7450 


TGAGCGGATA 


CATATTTGAA 


7500 
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TGTATTTAGA 
AGTGCCACCT 
AAAATAGGCG 
GGTGAAAACC 
GTAAGCGGAT 
TTGGCGGGTG 
CTGAGAGTGC 
ATACATAACC 



AAAATAAACf. 
GACGTCTAAG 
TATCACGAGG 
TCTGACACAT 
GCCGGGAGCA 
TCGGGGCTGG 
ACCATATGGA 
TTATGTATCA 



66 

AATAGGGGTT 
AAACCATTAT 
CCCTTTCGTC 



GCAGCTCCCG 
GACAAGCCCG 
CTTAACTATG 
CATATTGTCG 



TACACATACG 



CCGCGCACAT 
TATCATGACA 
TCGCGCGTTT 
GAJACGGTCA 
TCAGGGCGCG 
CGGCATCAGA 
TTAGAACGCG 
ATTTAGGTGA 



TTCCCCGAAA 
TTAACCTATA 
CGGTGATGAC 
CAGCTTGTCT 
TCAGCGGGTG 
GCAGATTGTA 
GCTACAATTA 

CACTATA 



7550 

7600 

7650 

7700 

7750 

7800 

7850 

7897 



(2) INFORMATION FOR SEQ ID NO:2: 

(A\ SEQUENCE CHARACTERISTICS: 

(i) sEgu^ ENGTH: 7852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
( X i) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

GAATTCGCTA GCTAGCGGGG GAATACATAC CCGCAGGCGT 

TTACAGCCCC CATAGGAGGT ATAACAAAAT TAATAGGAGA 

TAAACACCTG AAAAACCCTC CTGCCTAGGC AAAATAGCAC 

CAGAACAACA TACAGCGCTT CACAGCGGCA GCCTAACAGT 

AGTAAAAAAG AAAACCTATT AAAAAAACAC CACTCGACAC 

CAATCAGTCA CAGTGTAAAA AAGGGCCAAG TGCAGAGCGA 

GACTAAAAAA TGACGTAACG GTTAAAGTCC ACAAAAAACA 

CGCACGCGAA CCTACGCCCA GAAACGAAAG CCAAAAAACC 

TCAAATCGTC ACTTCCGTTT TCCCACGTTA CGTAACTTCC 

AAACTACAAT TCCCAACACA TACAAGTTAC TCCGCCCTAA 

ACCCGCCCCG WCCCACGCC CCGCGCCACG TCACAAACTC 



AGAGACAACA 
GAAAAACACA 
CCTCCCGCTC 
CAGCCTTACC 
GGCACCAGCT 
GTATATATAG 
CCCAGAAAAC 
CACAACTTCC 
CATTTTAAGA 
AACCTACGTC 
CACCCCCTCA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
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TTATCATATT 


GGCTTCAATC 


CAAAATAAGG 


TATATTATTG 


ATGATGCTAG 


600 


CATCATCAAT 


AATATACCTT 


ATTTTGGATT 


GAAGCCAATA 


TGATAATGAG 


650 


GGGGTGGAGT 


TTGTGACGTG 


GCGCGGGGCG 


TGGGAACGGG 


GCGGGTGACG 


700 


TAGTAGTGTG 


GCGGAAGTGT 


GATGTTGCAA 


GuGTGGCGGA 


ACACATGTAA 


750 


GCGACGGATG 


TGGCAAAAGT 


GACGTTTTTG 


GTGTGCGCCG 


GTGTACACAG 


800 


GAAGTGACAA 


TTTTCGCGCG 


GTTTTAGGCG 


GATGTTGTAG 


TAAATTTGGG 


850 


CGTAACCGAG 


TAAGATTTGG 


CCATTTTCGC 


GGGAAAACTG 


AATAAGAGGA 


900 


AGTGAAATCT 


GAATAATTTT 


GTGTTACTCA 


TAGCGCGTAA 


TATTTGTCTA 


950 


GGGAGATCAG 


CCTGCAGGTC 


GTTACATAAC 


TTACGGTAAA 


TGGCCCGCCT 


1000 


GGCTGACCGC 


CCAACGACCC 


CCGCCCATTG 


ACGTCAATAA 


TGACGTATGT 


1050 


TCCCATAGTA 


ACGCCAATAG 


GGACTTTCCA 


TTGACGTCAA 


TGGGTGGAGT 


1100 


ATTTACGGTA 


AACTGCCCAC 


TTGGCAGTAC 


ATCAAGTGTA 


TCATATGCCA 


1150 


AGTACGCCCC 


CTATTGACGT 


CAATGACGGT 


AAATGGCCCG 


CCTGGCATTA 


1200 


TGCCCAGTAC 


ATGACCTTAT 


GGGACTTTCC 


TACTTGGCAG 


TACATCTACG 


1250 


TATTAGTCAT 


CGCTATTACC 


ATGGTGATGC 


GGTTTTGGCA 


GTACATCAAT 


1300 


GGGCGTGGAT 


AGCGGTTTGA 


CTCACGGGGA 


TTTCCAAGTC 


TCCACCCCAT 


1350 


TGACGTCAAT 


GGGAGTTTGT 


TTTGGCACCA 


AAATCAACGG 


GACTTTCCAA 


1400 


AATGTCGTAA 


CAACTCCGCC 


CCATTGACGC 


AAATGGGCGG 


TAGGCGTGTA 


1450 


CGGTGGGAGG 


TCTATATAAG 


CAGAGCTCGT 


TTAGTGAACC 


GTCAGATCGC 


1500 


CTGGAGACGC 


CATCCACGCT 


GTTTTGACCT 


CCATAGAAGA 


CACCGGGACC 


1550 


GATCCAGCCT 


CCGGACTCTA 


GAGGATCCGG 


TACTCGAGGA 


ACTGAAAAAC 


1600 


CAGAAAGTTA 


ACTGGTAAGT 


TTAGTCTTTT 


TGTCTTTTAT 


TTCAGGTCCC 


1650 


GGATCCGGTG 


GTGGTGCAAA 


TCAAAGAACT 


GCTCCTCAGT 


GGATGTTGCC 


1700 


TTTACTTCTA 


GGCCTGTACG 


GAAGTGTTAC 


TTCTGCTCTA 


AAAGCTGCGG 


1750 


AATTGTACCC 


GCGGCCGCAA 


TTCCCGGGGA 


TCGAAAGAGC 


CTGCTAAAGC 


1800 


AAAAAAGAAG 


TCACCATGTC 


GTTTACTTTG 


ACCAACAAGA AC6T6ATTTT 


1850 
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CGTTGCCGGT 
AGCGCGATCC 
ACCCAACTTA 
TAGCGAAGAG 
ATGGCGAATG 
AGCTGGCTGG 
AAACTGGCAG 
ATCCCATTAC 
TGTTACTCGC 
GACGCGAATT 
ACGGGCGCTG 
GACCTGAGCG 
GCTGCGTTGG 
TGAGCGGCAT 
ATCAGCGATT 
TGTACTGGAG 
GGGTAACAGT 
GCGCCTTTCG 
CGTCACACTA 
TCCCGAATCT 
CTGATTGAAG 
AAATGGTCTG 
ACCGTCACGA 



CTGGGAGGCA 



CGTCGTTTTA 
ATCGCCTTGC 
GCCCGCACCG 
GCGCTTTGCC 
AGTGCGATCT 
ATGCACGGTT 
GGTCAATCCG 
TCACATTTAA 
ATTTTTGATG 
GGTCGGTTAC 



CATTTTTACG 



TTGGTCTGGA 
CAACGTCGTG 
AGCACATCCC 
ATCGCCCTTC 
TGGTTTCCGG 
TCCTGAGGCC 
ACGATGCGCC 
CCGTTTGTTC 
TGTTGATGAA 
GCGTTAACTC 
GGCCAGGACA 
CGCCGGAGAA 



CACCAGCAAG 



ACTGGGAAAA 
CCTTTCGCCA 



AGTGACGGCA GTTATCTGGA 

GTCTCGTTGC 



ATGGTGCAGG 
CTGTTCGCAT 
ACGGCCTGTA 



TTTCCGTGAC 
TCCATGTTGC 
GCTGAAGTTC 
TTCTTTATGG 
GCGGTGAAAT 
CGTCTGAACG 
CTATCGTGCG 
CAGAAGCCTG 
CTGCTGCTGA 
GCATCATCCT 
ATATCCTGCT 
TATCCGAACC 
TGTGGTGGAT 



CACTCGCTTT 
AGATGTGCGG 
CAGGGTGAAA 
TATCGATGAG 
TCGAAAACCC 
GTGGTTGAAC 
CGATGTCGGT 
ACGGCAAGCC 
CTGCATGGTC 
GATGAAGCAG 
ATCCGCTGTG 
GAAGCCAATA 



CCAACAGTTG 
CACCAGAAGC 
GATACTGTCG 
CATCTACACC 
CCACGGAGAA 
AGCTGGCTAC 
GGCGTTTCAT 
GTCGTTTGCC 
AACCGCCTCG 
AGATCAGGAT 
TGCATAAACC 
AATGATGATT 
CGAGTTGCGT 
CGCAGGTCGC 
CGTGGTGGTT 
GAAACTGTGG 



TGCACACCGC 
TTCCGCGAGG 
GTTGCTGATT 
AGGTCATGGA 
AACAACTTTA 
GTACACGCTG 
TTGAAACCCA 



GAGCTGCTCA 
CCCTGGCGTT 
GCTGGCGTAA 
CGCAGCCTGA 
GGTGCCGGAA 
TCGTCCCCTC 
AACGTAACCT 
TCCGACGGGT 
AGGAAGGCCA 
CTCTGGTGCA 
GTCTGAATTT 
CGGTGATGGT 
ATGTGGCGGA 
GACTACACAA 
TCAGCCGCGC 
GACTACCTAC 
CAGCGGCACC 
ATGCCGATCG 
AGCGCCGAAA 
CGACGGCACG 
TGCGGATTGA 
CGAGGCGTTA 
TGAGCAGACC 
ACGCCGTGCG 
TGCGACCGCT 
CGGCATGGTG 



1900 
1950 
2000 
2050 
2100 
2150 
2200 
2250 
2300 
2350 
2400 
2450 
2500 
2550 
2600 
2650 
2700 
2750 
2800 
2850 
2900 
2950 
3000 
3050 
3100 
3150 
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CCAATGAATC 


GTCTGACCGA 


TGATCCGCGC 


TGGCTACCGG 


CGATGAGCGA 


3200 


ACGCGTAACG 


CGAATGGTGC 


AGCGCGATCG 


TAATCACCCG 


AGTGTGATCA 


3250 


TCTGCTCGCT 


GGGGAATGAA 


TCAGGCCACG 


GCGCTAATCA 


CGACGCGCTG 


3300 


TATCGCTGGA 


TCAAATCTGT 


CGATCCTTCC 


CuCCCGGTGC 


AGTATGAAGG 


3350 


CGGCGGAGCC 


GACACCACGG 


CCACCGATAT 


TATTTGCCCG 


ATGTACGCGC 


3400 


GCGTGGATGA 


AGACCAGCCC 


TTCCCGGCTG 


TGCCGAAATG 


GTCCATCAAA 


3450 


AAATGGCTTT 


CGCTACCTGG 


AGAGACGCGC 


CCGCTGATCC 


TTTGCGAATA 


3500 


CGCCCACGCG 


ATGGGTAACA 


GTCTTGGCGG 


TTTCGCTAAA 


TACTGGCAGG 


3550 


CGTTTCGTCA 


GTATCCCCGT 


TTACAGGGCG 


GCTTCGTCTG 


GGACTGGGTG 


3600 


GATCAGTCGC TGATTAAATA TGATGAAAAC GGCAACCCGT GGTCGGCTTA 


3650 


CGGCGGTGAT 


TTTGGCGATA 


CGCCGAACGA 


TCGCCAGTTC 


TGTATGAACG 


3700 


GTCTGGTCTT 


TGCCGACCGC 


ACGCCGCATC 


CAGCGCTGAC 


GGAAGCAAAA 


3750 


CACCAGCAGC 


AGTTTTTCCA 


GTTCCGTTTA 


TCCGGGCAAA 


CCATCGAAGT 


3800 


GACCAGCGAA 


TACCTGTTCC 


GTCATAGCGA 


TAACGAGCTC 


CTGCACTGGA 


3850 


TGGTGGCGCT 


GGATGGTAAG 


CCGCTGGCAA 


GCGGTGAAGT 


GCCTCTGGAT 


3900 


GTCGCTCCAC 


AAGGTAAACA 


GTTGATTGAA 


CTGCCTGAAC 


TACCGCAGCC 


3950 


GGAGAGCGCC 


GGGCAACTCT 


GGCTCACAGT 


ACGCGTAGTG 


CAACCGAACG 


4000 


CGACCGCATG 


GTCAGAAGCC 


GGGCACATCA 


GCGCCTGGCA 


GCAGTGGCGT 


4050 


CTGGCGGAAA 


ACCTCAGTGT 


GACGCTCCCC 


GCCGCGTCCC 


ACGCCATCCC 


4100 


GCATCTGACC 


ACCAGCGAAA 


TGGATTTTTG 


CATCGAGCTG 


GGTAATAAGC 


4150 


GTTGGCAATT 


TAACCGCCAG 


TCAGGCTTTC 


TTTCACAGAT 


GTGGATTGGC 


4200 


GATAAAAAAC 


AACTGCTGAC 


GCCGCTGCGC 


GATCAGTTCA 


CCCGTGCACC 


4250 


GCTGGATAAC 


GACATTGGCG 


TAAGTGAAGC 


GACCCGCATT 


GACCCTAACG 


4300 


CCTGGGTCGA 


ACGCTGGAAG 


GCGGCGGGCC 


ATTACCAGGC 


CGAAGCAGCG 


4350 


TTGTTGCAGT 


GCACGGCAGA 


TACACTTGCT 


GATGCGGTGC 


TGATTACGAC 


4400 


CGCTCACGCG 


TGGCAGCATC 


AGGGGAAAAC 


CTTATTTATC AGCCGGAAAA 


4450 
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CCTACCGGAT TGATGGTAGT GGTCAAATGG CGATTACCGT 
GTGGCGAGCG ATACACCGCA TCCGGCGCGG ATTGGCCTGA 
GGCGCAGGTA GCAGAGCGGG TAAACTGGCT CGGATTAGGG 
ACTATCCCGA CCGCCTTACT GCCGCCTGTT TTuACCGCTG 



TTGTCAGACA 
CTGCGGGACG 
TCCAGTTCAA 
CATCGCCATC 
TTTCCATATG 



TGTATACCCC 
CGCGAATTGA 
CATCAGCCGC 
TGCTGCACGC 
GGGATTGGTG 



GTACGTCTTC 

ATTATGGCCC 

TACAGTCAAC 

GGAAGAAGGC 

GCGACGACTC 



CGGAATTACA GCTGAGCGCC GGTCGCTACC 



CAAAAATAAT 
GAAATCCATT 



AATAACCGGG 
ATGTACTATT 



CCGAGCGAAA 
ACACCAGTGG 
AGCAACTGAT 
ACATGGCTGA 
CTGGAGCCCG 
ATTACCAGTT 
CTGCCCGTAT 
AAACTTTTGG 



TATTCTTTTT CTTTTACTTT 
CCGATTTGGC TACATGACAT 
TATTTTTGCC GCTATTTCTC 



GTCTGCTTTC 
AGACATGATA 
AGTGAAAAAA 
GTAACCATTA 
TTTTATGTTT 
TAGAGTCGAC 
GCTTCCGGCG 
GGTAGATGAC 
CCAGCCTAAC 
GCCTCGGCGA 
ATACCTTGTC 



CAGGCCATGT 
TAAAAAACAC 
TTTATCATGG 

CAACCATATC AGCAAAAGTG 
TGTTCTCGCT ATTATTCCAA 



TGATGTTGAA 
ACTGCCAGCT 
CCGCAAGAAA 
GGATCTGCCA 
ACGGTCTGCG 
CGCGGCGACT 
GGAAACCAGC 
ATATCGACGG 
TCAGTATCGG 
GGTCTGGTGT 
TTCGCGTAAG 
ATGTTCGGTT 



TGACAAACTC 
AGATACATTG 
ATGCTTTATT 
TAAGCTGCAA 
CAGGTTCAGG 
GACGCGAGGC 
GCATCGGGAT 



GACCATCAGG 
TTCGATCACT 
GCACATGGAA 
TGCCTCCCCG 



GAGCCTACTT CCCGTTTTTC 

ATACGGGTAT 
CCGCTGTTTG 
GCGGGGATCC 
ACTAGAATGC 
TGCTTTATTT 
ATTGCATTCA 
TCGGATCCTC 
GATTCTTCTC 
TGTCCAGGCA 
GCGGCTCTTA 



GGCCTCGACT CTAGGCGGCC 
ATGAGTTTGG ACAAACCACA 
TGTGAAATTT GTGATGCTAT 
TAAACAAGTT AACAACAACA 
GGGAGGTGTG GGAGGTTTTT 
TGGATGGCCT TCCCCATTAT 
GCCCGCGTTG CAGGCCATGC 



GACAGCTTCA AGGATCGCTC 
GGACCGCTGA TCGTCACGGC 
CGGGTTGGCA TGGATTGTAG 
CGTTGCGTCG CGGTGCATGG 



GATTTATGCC 
GCGCCGCCCT 
AGCCGGGCCA 



4500 

4550 

4600 

4650 

4700 

4750 

4800 

4850 

4900 

4950 

5000 

5050 

5100 

5150 

5200 

5250 

5300 

5350 

5400 

5450 

5500 

5550 

5600 

5650 

5700 

5750 
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CCTCGACCTG 


AATGGAAGCC 


GGCGGCACCT 


CGCTAACGGA 


TTCACCACTC 


5800 


CAAGAATTGG 


AGCCAATCAA 


TTCTTGCGGA 


GAACTGTGAA 


TGCGCAAACC 


5850 


AACCCTTGGC 


AGAACATATC 


CATCGCGTCC 


GCCATCTCCA 


GCAGCCGCAC 


5900 


GCGGCGCATC 


TCGGGCAGCG 


TTGGGTCCTG 


GJCACGGGTG 


CGCATGATCG 


5950 


TGCTCCTGTC 


GTTGAGGACC 


CGGCTAGGCT 


GGCGGGGTTG 


CCTTACTGGT 


6000 


TAGCAGAATG AATCACCGAT 


ACGCGAGCGA 


ACGTGAAGCG 


ACTGCTGCTG 


6050 


CAAAACGTCT 


GCGACCTGAG 


CAACAACATG 


AATGGTCTTC 


GGTTTCCGTG 


6100 


TTTCGTAAAG 


TCTGGAAACG 


CGGAAGTCAG 


CGCCCTGCAC 


CATTATGTTC 


6150 


CGGATCTGCA 


TCGCAGGATG 


CTGCTGGCTA 


CCCTGTGGAA 


CACCTACATC 


6200 


TGTATTAACG 


AAGCCTTTCT 


CAATGCTCAC 


GCTGTAGGTA 


TCTCAGTTCG 


6250 


GTGTAGGTCG 


TTCGCTCCAA 


GCTGGGCTGT 


GTGCACGAAC 


CCCCCGTTCA 


6300 


GCCCGACCGC 


TGCGCCTTAT 


CCGGTAACTA 


TCGTCTTGAG 


TCCAACCCGG 


6350 


TAAGACACGA 


CTTATCGCCA 


CTGGCAGCAG 


CCACTGGTAA 


CAGGATTAGC 


6400 


AGAGCGAGGT 


ATGTAGGCGG 


TGCTACAGAG 


TTCTTGAAGT 


GGTGGCCTAA 


6450 


CTACGGCTAC 


ACTAGAAGGA 


CAGTATTTGG 


TATCTGCGCT 


CTGCTGAAGC 


6500 

i 


CAGTTACCTT 


CGGAAAAAGA 


GTTGGTAGCT 


CTTGATCCGG 


CAAACAAACC 


6550 


ACCGCTGGTA 


GCGGTGGTTT 


TTTTGTTTGC 


AAGCAGCAGA 


TTACGCGCAG 


6600 


AAAAAAAGGA 


TCTCAAGAAG 


ATCCTTTGAT 


CTTTTCTACG 


GGGTCTGACG 


6650 


CTCAGTGGAA 


CGAAAACTCA 


X X AAuuuA 


XXX loulwil 


ununl Xx\X 


£700 


AAAAGGATCT 


TCACCTAGAT 


CCTTTTAAAT 


TAAAAATGAA 


GTTTTAAATC 


6750 


AATCTAAAGT ATATATGAGT 


AAACTTGGTC 


TGACAGTTAC 


CAATGCTTAA 


6800 


TCAGTGAGGC 


ACCTATCTCA 


GCGATCTGTC 


TATTTCGTTC 


ATCCATAGTT 


6850 


GCCTGACTCC 


CCGTCGTGTA 


GATAACTACG 


ATACGGGAGG 


GCTTACCATC 


6900 


TGGCCCCAGT 


GCTGCAATGA 


TACCGCGAGA 


CCCACGCTCA 


CCGGCTCCAG 


6950 


ATTTATCAGC 


AATAAACCAG 


CCAGCCGGAA 


GGGCCGAGCG 


CAGAAGTGGT 


7000 


CCTGCAACTT 


TATCCGCCTC 


CATCCAGTCT 


ATTAATTGTT 


GCCGGGAAGC 


7050 
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TAGAGTAAGT 
CTGCAGGCAT 
TCCGGTTCCC 
AAAAGCGGTT 
CCGCAGTGTT 
GTCATGCCAT 
GTCATTCTGA 
CAACACGGGA 
ATTGGAAAAC 
GAGATCCAGT 
CTTTTACTTT 
GCCGCAAAAA 
CTTCCTTTTT 



GCGGATACAT 
CGCACATTTC 
CATGACATTA 



AGTTCGCCAG 
CGTGGTGTCA 
AACGATCAAG 
AGCTCCTTCG 
ATCACTCATG 
CCGTAAGATG 
GAATAGTGTA 
TAATACCGCG 
GTTCTTCGGG 
TCGATGTAAC 
CACCAGCGTT 
AGGGAATAAG 
CAATATTATT 
ATTTGAATGT 
CCCGAAAAGT 
ACCTATAAAA 



72 

TTAATAGTTT 
CGCTCGTCGT 
GCGAGTTACA 



GCGCAACGTT 
TTGGTATGGC 
TCATCCCCCA 



GTCCTCCGAT 
GTTATGCCAG 
CTTTTCTGTG 
TGCGGCGACC 
CCACATAGCA 
GCGAAAACTC 
CCACTCGTGC 
TCTGGGTGAG 
GGCGACACGG 
GAAGCATTTA 
ATTTAGAAAA 



GCCACCTGAC 
ATAGGCGTAT 



CGITGTCAGA 
CACTGCATAA 
ACTGGTGAGT 
GAGTTGCTCT 
CAACTTTAAA 
TCAAGGATCT 
ACCCAACTGA 
CAAAAACAGG 
AAATGTTGAA 
TCAGGGTTAT 
ATAAACAAAT 
GTCTAAGAAA 
CACGAGGCCC 



GTTGCCATTG 
TTCATTCAGC 
TGTTGTGCAA 
AGTAAGTTGG 
TTCTCTTACT 
ACTCAACCAA 
TGCCCGGCGT 
AGTGCTCATC 
TACCGCTGTT 
TCTTCAGCAT 
AAGGCAAAAT 
TACTCATACT 
TGTCTCATGA 
AGGGGTTCCG 
CCATTATTAT 
TTTCGTCTTC 



AA 



7100 

7150 

7200 

7250 

7300 

7350 

7400 

7450 

7500 

7550 

7600 

7650 

7700 

7750 

7800 

7850 

7852 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9972 base pairs 

(B) TYPE: nucleic acid . 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
TCTTCCGCTT CCTCGCTCAC ^ACTCGCTG CGCTCGGTCG TTCGGCTGCG 
GCGAGCGGTA TCAGCTCACT CAAAGGCGGT AATACGGTTA TCCACAGAAT 



50 
100 
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CAGGGGATAA CGCAG6AAAG AACATGTGAG 
AGGAACCGTA AAAAGGCCGC GTTGCTGGCG 
CCCTGACGAG CATCACAAAA ATCGACGCTC 
CGACAGGACT ATAAAGATAC CAGGCGTTTC 
CGCTCTCCTG TTCCGACCCT GCCGCTTACC 
CCCTTCGGGA AGCGTGGCGC TTTCTCATAG 
GTTCGGTGTA GGTCGTTCGC TCCAAGCTGG 
GTTCAGCCCG ACCGCTGCGC CTTATCCGGT 
CCCGGTAAGA CACGACTTAT CGCCACTGGC 
TTAGCAGAGC GAGGTATGTA GGCGGTGCTA 
CCTAACTACG GCTACACTAG AAGAACAGTA 
GAAGCCAGTT ACCTTCGGAA AAAGAGTTGG 
AAACCACCGC TGGTAGCGGT GGTTTTTTTG 
CGCAGAAAAA AAGGATCTCA AGAAGATCCT 
TGACGCTCAG TGGAACGAAA ACTCACGTTA 
TATCAAAAAG GATCTTCACC TAGATCCTTT 
AAATCAATCT AAAGTATATA TGAGTAAACT 
CTTAATCAGT GAGGCACCTA TCTCAGCGAT 
TAGTTGCCTG ACTCCCCGTC GTGTAGATAA 
CCATCTGGCC CCAGTGCTGC AATGATACCG 
TCCAGATTTA TCAGCAATAA ACCAGCCAGC 
GTGGTCCTGC AACTTTATCC GCCTCCATCC 
GAAGCTAGAG TAAGTAGTTC GCCAGTTAAT 
CATTGCTACA GGCATCGTGG TGTCACGCTC 
TCAGCTCCGC TTCCCAACGA TCAAGGCGAG 
TGCAAAAAAG CGGTTAGCTC CTTCGGTCCT 



CAAAAGGCCA 


GCAAAAGGCC 


150 


TTTTTCCATA 


GGCTCCGCCC 


200 


AAGTCAGAGG 


TGGCGAAACC 


250 


CCCCTGGAAG 


CTCCCTCGTG 


300 


GGATACCTGT 


CCGCCTTTCT 


350 


CTCACGCTGT 


AGGTATCTCA 


400 


GCTGTGTGCA 


CGAACCCCCC 


450 


AACTATCGTC 


TTGAGTCCAA 


500 


AGCAGCCACT 


GGTAACAGGA 


550 


CAGAGTTCTT 


GAAGTGGTGG 


600 


TTTGGTATCT 


GCGCTCTGCT 


650 


TAGCTCTTGA 


TCCGGCAAAC 


700 


TTTGCAAGCA 


GCAGATTACG 


750 


TTGATCTTTT 


CTACGGGGTC 


800 


AGGGATTTTG 


GTCATGAGAT 


850 


TAAATTAAAA 


ATGAAGTTTT 


900 


TGGTCTGACA 


GTTACCAATG 


950 


CTGTCTATTT 


CGTTCATCCA 


1000 


CTACGATACG 


GGAGGGCTTA 


1050 


CCAGACCCAC 


GCTCACCGGC 


1100 


CGGAAGGGCC 


GAGCGCAGAA 


1150 


AGTCTATTAA 


TTGTTGCCGG 


1200 


AGTTTGCGCA 


ACGTTGTTGC 


1250 


GTCGTTTGGT 


ATGGCTTCAT 


1300 


TTACATGATC 


CCCCATGTTG 


1350 


CCGATCGTTG 


TCAGAAGTAA 


1400 



WO 96/13597 



PCT/US95/14017 



74 



GTTGGCCGCA GTGTTATCAC TCATGGTTAT GGCAGCACTG CATAATTCTC 

TTACTGTCAT GCCATCCGTA AGATGCTTTT 

GTGTATGCGG 

CCGCGCCACA 



ACCAAGTCAT TCTGAGAATA 
GGCGTCAATA CGGGATAATA 
TCATCATTGG AAAACGTTCT 
CTGTTGAGAT CCAGTTCGAT 
AGCATCTTTT ACTTTCACCA 
AAAATGCCGC AAAAAAGGGA 
ATACTCTTCC TTTTTCAATA 
CATGAGCGGA TACATATTTG 
TTCCGCGCAC ATTTCCCCGA 
ATTATCATGA CATTAACCTA 
TCTCGCGCGT TTCGGTGATG 
CGGAGACGGT CACAGCTTGT 



GTAACCCACT 
GCGTTTCTGG 



CTGTGACTGG TGAGTACTCA 
CGACCGAGTT GCTCTTGCCC 
TAGCAGAACT TTAAAAGTGC 
TCGGGGCGAA AACTCTCAAG GATCTTACCG 

CGTGCACCCA ACTGATCTTC 
GTGAGCAAAA ACAGGAAGGC 
ATAAGGGCGA CACGGAAATG TTGAATACTC 
TTATTGAAGC ATTTATCAGG GTTATTGTCT 
AATGTATTTA GAAAAATAAA CAAATAGGGG 
AAAGTGCCAC CTGACGTCTA AGAAACCATT 
TAAAAATAGG CGTATCACGA GGCCCTTTCG 
ACGGTGAAAA CCTCTGACAC ATGCAGCTCC 
CTGTAAGCGG ATGCCGGGAG CAGACAAGCC 



CGTCAGGGCG CGTCAGCGGG TGTTGGCGGG 
TGCGGCATCA GAGCAGATTG 

GTTAAAATTC 
AGGCCGAAAT 

GGGTTGAGTG TTGTTCCAGT 
GGACTCCAAC GTCAAAGGGC 



TTAATATTTT 
TTTAACCAAT 
GCCCGAGATA 
TAAAGAACGT 
GATGGCCCAC 
GTGCCGTAAA 
CTTGACGGGG 
AAAGGAGCGG 



TACTGAGAGT 
GCGTTAAATT 
CGGCAAAATC 



TGTCGGGGCT 

GCACCATAAA 

TTTGTTAAAT 

CCTTATAAAT 

TTGGAACAAG 

GAAAAACCGT 



TACGTGAACC 
GCACTAAATC 
AAAGCCGGCG 
GCGCTAGGGC 
CCCGCCGCGC 



GGCTTAACTA 
ATTGTAAACG 
CAGCTCATTT 
CAAAAGAATA 
AGTCCACTAT 
CTATCAGGGC 
TGGGGTCGAG 
CGATTTAGAG 
GAAGAAAGCG 



AACCACCACA 
GTTGCTTTGA CGTATGCGGT 



ATCACCCAAA TCAAGTTTTT 
GGAACCCTAA AGGGAGCCCC 
AACGTGGCGA GAAAGGAAGG 
GCTGGCAAGT GTAGCGGTCA CGCTGCGCGT 
TTAATGCGCC GCTACAGGGC GCGTACTATG 
GTGAAATACC GCACAGATGC GTAAGGAGAA 



1450 

1500 

1550 

1600 

1650 

1700 

1750 

1800 

1850 

1900 

1950 

2000 

2050 

2100 

2150 

2200 

2250 

2300 

2350 

2400 

2450 

2500 

2550 

2600 

2650 

2700 



WO 96/13597 PCTAJS95/14017 



75 



AATACCGCAT 


CAGGCGCCAT 


TCGCCATTCA 


GGCTGCGCAA 


CTGTTGGGAA 


2750 


GGGCGATCGG 


TGCGGGCCTC 


TTCGCTATTA 


CGCCAGCTGG 


CGAAAGGGGG 


2800 


ATGT6CTGCA 


AGGCGATTAA 


GTTGGGTAAC 


GCCAGGGTTT 


TCCCAGTCAC 


2850 


GACGTTGTAA 


AACGACGGCC 


AGTGCCAAGC 


Tj?aaggtgca 


CGGCCCACGT 


2900 


GGCCACTAGT 


ACTTCTCGAG 


CTCTGTACAT 


GTCCGCGGTC 


GCGACGTACG 


2950 


CGTATCGATG 


GCGCCAGCTG 


CAGGCGGCCG 


CCATATGCAT 


CCTAGGCCTA 


3000 


TTAATATTCC 


GGAGTATACG 


TAGCCGGCTA 


ACGTTAACAA 


CCGGTACCTC 


3050 


TAGAACTATA 


GCTAGCCAAT 


TCCATCATCA 


ATAATATACC 


TTATTTTGGA 


3100 


TTGAAGCCAA 


TATGATAATG 


AGGGGGTGGA 


GTTTGTGACG 


TGGCGCGGGG 


3150 


CGTGGGAACG 


GGGCGGGTGA 


CGTAGGTTTT 


AGGGCGGAGT 


AACTTGTATG 


3200 


TGTTGGGAAT 


TGTAGTTTTC 


TTAAAATGGG 


AAGTTACGTA 


ACGTGGGAAA 


3250 


ACGGAAGTGA 


CGATTTGAGG 


AAGTTGTGGG 


TTTTTTGGCT 


TTCGTTTCTC 


3300 


GGCGTAGGTT 


CGCGTGCGGT 


TTTCTGGGTG 


TTTTTTGTGG 


ACTTTAACCG 


3350 


TTACGTCATT 


TTTTAGTCCT 


ATATATACTC 


GCTCTGCACT 


TGGCCCTTTT 


3400 


TTACACTGTG 


ACTGATTGAG 


i 

CTGGTGCCGT 


GTCGAGTGGT 


GlwriTTAA 


3450 


TAGGTTTTCT 


TTTTTACTGG 


TAAGGCTGAC 


TGTTAGGCTG 


CCGCTGTGAA 


3500 


GCGCTGTATG 


TTGTTCTGGA 


GCGGGAGGGT 


GCTATTTTGC 


CTAGGCAGGA 


3550 


GGGTTTTTCA 


GGTGTTTATG 


TGTTTTTCTC 


TCCTATTAAT 


TTTGTTATAC 


3600 








GCCTGCGGGT 


ATGTATTCCC 


3650 

w *S \J 


CCCAAGCTTG 


CATGCCTGCA 


GGTCGACTCT 


AGAGGATCCG AAAAAACCTC 


3700 


CCACACCTCC 


CCCTGAACCT 


GAAACATAAA 


ATGAATGCAA 


TTGTTGTTGT 


3750 


TAACTTGTTT 


ATTGCAGCTT 


ATAATGGTTA 


GAAATAAAGC 


AATAGCATCA 


3800 


CAAATTTCAC 


AAATAAAGCA 


TTTTTTTCAC 


TGCATTCTAG 


TTGTGGTTTG 


3850 


TCCAAACTCA 


TCAATGTATC 


TTATCATGTC 


TGGATCCCCC 


TAGCTTGCCA 


3900 


AACCTACAGG 


TGGGGTCTTT 


CATTCCCCCC 


TTTTTCTGGA 


GACTAAATAA 


3950 


AATCTTTTAT 


TTTATCTATG 


GCTCGTACTC 


TATAGGCTTC 


AGCTGGTGAT 


4000 
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ATTGTTGAGT 
AATTGGACTA 
CAACATTTAT 
TCTCCTCTTT 
TTCCGGTGGG 
GAAGAGGCTC 
CTTTGTTCTC 
ATCCTGTGTT 
AGTTCTTCTA 



CAAAACTAGA 
ATCGCGGGAT 
GCTGCTCTCT 
CAGAGCAGCA 
GAAAGAGCTT 



GCCTGGACCA CTGATATCCT 
CAGCCAATTC CATGAGCAAA 
AAAGCCTTGT ATCTTGCATC 
ATCTGGGGCT TAGACTTGCA 
CACCCTGTCG GAGGGGCTGA 



CTCTCGTTCA GCAGTTTCTG 

AAAAATTGTT 



TTCTATGACC 
CACAGAGAAT 
ATTATTTGGT 



GATGGAATCG 
GGCATTCCAG 



GTCTTTAACA 

TGTCCCATGT 

TCTTCTTCTG 

CTTGCTTGAG 

TGGCTTGCCG 

TACTGCCGCA 

CATTGCTTCT 



TACTGTGCAA TCAGCAAATG 
ATGTTACTGG ATCCAAATGA 



CATCAAGCAG CAAGATCTTC GCCTTACTGA GAACAGATCT 
ATCAACTGCT TGTGGCCATG 



AAAGTCAAGC 
CATCTGCAAC 
AAGTTTTTTC 
TCCAAAGGCT 
CATCGATCTG 
AACAAAGTAC 
GCCAGGACTT 



TTCCCAGGAA 

TTTCCATATT 

TAAATGTTCC 

TTCCTCCACT 

GATTTCTCCT 

TCTTCCCTGA 

ATTGAGAAGG 



GCTTAGGACA CAGCCCCCAT 
ACTGTTCTAT CACAGATCTG 
TCTTGATCAC TCCACTGTTC 
AGAAAAAATA AATACTTTCT 
GTTGCAAAGT TATTGAATCC 
TCAGTGTTCA GTAGTCTCAA 
TCCAGTTCTT CCCAAGAGGC 
AAATGTTCTC TAATATGGCA 



CTGTGTATTT 
CAGATGTCAT 
GAGTTGGCCA 
TTGGCATGTC 



TGCTGTGAGA TCTTTGACAG 
CTTTCTTCAC GTGTGAATTC 



TTCTTGTATG 
AATGAACTTA 



GTTTGGTTGA 
AAGACTCGGC 



TCATTTGGCC 
TCAATAATCA 
CTTGGTAGGT 



TCCACATCTA TGCTGGAGTT TACAGCCCAC 
CATGGCTAAA GTCAGGATAA TACCAACTCT 



TTAAAATGGA 
TCTATTCTCA 



AATGAAGGTA 
TTTGGAACCA 



ACAGCAATGA 
GCGCAGTGTT 



TCACAGATCG 
TGCAATGTAC 
TCCTTCTCCT 
AGAAGATGAC 
GACAGGTACA 



CTTGTTTTAG 
GCACTGGGTT 
AGCCAAGCAC 
CCACAAGGAC 
AGCCCAACCT 
ATAGGGATCC 
GTGGTATCAC 
CAAGACACAC 
AAAAGCTGAT 
CCACCCTCTG 
TTTCCACCTT 
CCCTGAGGGC 
TAACTTTCGA 
TTACCTTCTG 
CATCAAGCTA 
TCATGATATT 



TCTCCTGTTG 
AAAAATCATT 
AGAACCAGTT 



4050 
4100 
4150 
4200 
4250 
4300 
4350 
4400 
4450 
4500 
4550 
4600 
4650 
4700 
4750 
4800 
4850 
4900 
4950 
5000 
5050 
5100 
5150 
5200 
5250 
5300 
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GGCAGTATGT AAATTCAGAG CTTTGTGGAA 
GCCGTCCGAA GGCACGAAGT GTCCATAGTC 
TGAGTGAAAA TTGGACTCCT GCCTTCAGAT 
TGAGGTTTGG AGGAAATATG CTCTCAACAT 
GCACTGTTGC AACAAAGATG TAGGGTTGTA 
GCTCCAATCA CAATTAATAA CAACTGGATG 
CAGAAGGTCA TCCAAAATTG CTATATCTTT 
CACCTGCTTT CAACGTGTTG AGGGTTGACA 
TGTAACATTT TGTGGTGTAA AATTTTCGAC 
CAGTGGTAGA CCTCTGAAGA ATCCCATAGC 
CCACGTAAAT GTAAAACACA TAATACGAAC 
GCATAGCTGT TATTTCTACT ATGAGTACTA 
AGTGTTTCCA AGGAGCCACA GCACAACCAA 
GAAAAATTAC TAAGCACCAA ATTAGCACAA 
GTAATATATC GAAGGTATGT GTTCCATGTA 
CATATCATCA AAAAAGCACT CCTTTAAGTC 
TTATTTCCAA GCCAGTTTCT TGAGATAACC 
TCAGTCAAGT TTGCCTGAGG GGCCAGTGAC 
TGTCTTTCGG TGAATGTTCT GACCTTGGTT 
TCAGGACAGA CTGCCTCCTT CGTGCCTGAA 
ACGCTGATGC GAGGCAGTAT CGCCTCTCCC 
GGACAGCCTT CTCTCTAAAG GCTCATCAGA 
TTTGTAAGGG AGTCTTTTGC ACAATGGAAA 
GGATTGAGAA TAGAATTCTT CCTTTTTTCC 
AAAAGATTGT TTTTTTGTTT CTGTCCAGGA 
ATGAGAAACG GTGTAAGGTC TCAGTTAGGA 
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CAGAGTTTCA 


AAGTAAGGCT 


5350 


CTTTTAAGCT 


TGTAACAAGA 


5400 


TCCAGTTGTT 


TGAGTTGCTG 


5450 


AATAAAAGCC 


ACTATCACTG 


5500 


AAACTGCGAC 


AACTGCTATA 


5550 


AAGTGAAATA 


TGGTAAGAGG 


5600 


GGAGAATCTA 


TTAAGAATCC 


5650 


TAGGTGCTTG 


AAGAACAGAA 


5700 


ACTGTGATTA 


GAGTATGCAC 


5750 


AAGCAAAGTG 


TCGGCTACTC 


5800 


TGGTGCTGGT 


GATAATCACT 


5850 


TTCCCTTTGT 


CTTGAAGAGG 


5900 


AGAAGCAGCC 


ACCTCTGCCA 


5950 


AAATTAAGCT 


CTTGTGGACA 


6000 


GTCACTGCTG 


GTATGCTCTC 


6050 


TTCTTCGTTA 


ATTTCTTCAC 


6100 


TTCTTGAATA 


TATATCCAGT 


6150 


ACTTTTCGTG 


TGGATGCTGT 


6200 


AACTGAGTGT 


GTCATCAGGT 


6250 


GCGTGGGGCC 


AGTGCTGATC 


6300 


• 

TGCTCAGAAT 


CTGGTACTAA 


6350 


ATCCTCTTCG 


ATGCCATTCA 


6400 


ATTTTCGTAT 


AGAGTTGATT 


6450 


CCAAACTCTC 


CAGTCTGTTT 


6500 


GACAGGAGCA 


TCTCCTTCTA 


6550 


TTGAATTTCT 


TCTTTCTGCA 


6600 



WO 96/13597 



PCT/US95/14017 



78 



CTAAATTGGT 
CTGTAGATTT 
CATGCAAAAT 
GTGACCAAAA 
AAATATTTCT 
ATAAATACAA 
GCTCGTTGAC 
GTCTTTCTCT 
TGACGCTTCT 
TCTTTAATGG 
TCTTCCACTG 
TCATCATTAG 
AACAACTGTC 
ACCAAGAAGT 



CGAAAGAATC 
TGGAGTTCTG 



TAATATTTTG 



TTTTTCTATT 
CCCTCCTCCC 
TAAGTTATAT 
TTTTGTTTAT 
AATTGCCGAG 
GAATATTTTC 
ATAAAAACAC 
TATCTCACAT 
GTTTTCAATC 



ATTGGATGTT 
TTCCCAGCTC 
CTGAAAAAGG 



TCCTAGTTTT 
TTTTCTGTTA 
ATCAGCATCT 
CTCCACTCAG 
GCAAACTTGG 
GTATCTATAT 
TGCCAGGCAT 
TGCTTAATTT 
AAGTGAAGTC 
CTCTTTCTAT 
GAGAAATTAC 
GTTATTGTTT 
AGAAGGCTGT 
TCCAATGTCT 
TGCTCCAAGA 
TGACCGCCAT 
CGGAGGATGA 
CACAAAGAAC 
AGGCTGCCTT 
ATTTTTTCCA 
CTCGATCATT 
TCTGATCTCT 
GCAAGGACTA 



ACATCCCATG 
AAAATGTCCC 
TCAGCTTTCT 
GTTAGCCATC 
AAACATCTAG 
TTGTATACTG 
TGTGATTCCA 
AGATGTCCTC 
TCATCATAGG 
AATCCAGGAA 
TACCCTCTGA 
TTGCCTGCTC 
CTTGAAATTA 
TGAAGAAGAG 
TGTTTTGCTT 
TACATTCTCC 
TATATTCTTG 
GAGTCATACC 
GCGCAGAACA 
TTCCTTTGAT 
CCTGAGAAGA 
CCGAGTCAGT 
TTGCTTCTTC 
TCTGAGGTAA 
GTACTTCATC 
TCAGGAAACC 



AGTTTTGAGC TAAAGTCTGG 

CTGCTACCTT 



ATAAAAATAG 
TTAAATGTTC 
AGTTTACAGA 
GTATCCAAAA 
CTCTTGCTAA 
CCTTCTCCAA 
TTCTAGTTGG 
AAACACCAAA 
AACTGAGAAC 
AGGCTCCAGT 
CAGTGGATCC 
ATATCTTTCA 
GCTGTCATCA 
TCTCAAATAA 
ATCACTACTT 



CATTTTAGAA 



CTTTTGTAAG 



ATGTTTGTAC 
ATGCAGAATG 
TAGTGCATAG 
AGAAGGCTGA 
TTCAGTTCTG 
CCAGCAGTAT 
TCACAAGTCT 
ATCATTCTCC 
AAGTCCACAG 



CACAGCTTTC 
GGAGAGTCTA 
AGAAATTCTT 
GAACTATATT 
CATGCTTTGA 
GATGATATTT 
AGAATGAAAT 
TCTCCCATAA 
AGCAACCGCC 
GGACAGGAGT 
CCATTAGAAG 
TTCCCCAAAT 
CTGTAGTCGT 
AAATCCTGTA 
AGCCCAGGGA 
AGATGGTGGT 
GGAAGCACAG 
GCTATTGAAG 
TTTGTCTTAA 
GCCTTAACAG 
TTCACTGATC 
CTAGCCCAGC 



AAGGCAGACG 
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6750 
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CCTGTAACAA CTCCCAGATT AGCCCCATGA 
GCGATCCACA CGAAATGTGC CAATGCAAGT 
GTTGTTGGAA AGGAGACTAA CAAGTTGTCC 
CACGGCTTGA CAGCTTTAAA GTCTTCTTAT 
ATTCTCATCT GCATTCCAAT GTGATGAAGG 
GAGCAGTGTC CTCACAATAA AGAGAAGGCA 
TCGCGATAGA GCGTTCCTCC TTGTTATCCG 
CTTCCCAGTA AGAGAGGCTG TACTGCTTTG 
AAAGATTCCA TAGAACATAA ATCTCCAGAA 
TAATGAGTTT AGGATTTTTC TTTGAAGCCA 
TCCAATTTTT CAGATAGATT GTCAGCAGAA 
TATGTCTGAC AATTCCAGGC GCTGTCTGTA 
TGGTCCAGCT GAAAAAAAGT TTGGAGACAA 
GACCTCTGCA TGGTCTCTCG GGCGCTGGGG 
GCTCAAGCTC CTAATGCCAA AGGAATTCCT 
GTTCTAGAGC GGCCGCCACC GCGGTGGCTG 
GCGCTTCGCT TTTTATAGGG CCGCCGCCGC 
AAACTTTCGG AGCGCGCCGC TCTGATTGGC 
TCGCCCCGCC CCGCCCCTCG CCCCGCCCCG 
CCCCCCCCCC CCGCCCCCAT CGCTGCACAA 
ATACAAAATT GGGGGTGGGG AGGGGGGGGA 
ACGTGGCCTC GAGTAGATGT ACTGCCAAGT 
TGTACTGGGC ATAATGCCAG GCGGGCCATT 
GGGGGCGTAC TTGGCATATG ATACACTTGA 
TTACCGTAAA TACTCCACCC ATTGACGTCA 
TTACTATGGG AACATACGTC ATTATTGACG 
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GGAGTGCCAC 


TTGCAAAGGA 


7950 


CCTTCATCAA 


ATTTGTTCAG 


8000 


AATACTTATT 


TTATCTAGAA 


8050 


AAATCAAACT 


AAACATAGCT 


8100 


CCAAAAATGG 


CTGGGTGTAG 


8150 


TAAGCCTATG 


CCTAGATAAA 


8200 


GGTCATAGGA 


AGCTATGATT 


8250 


GTGACTTCCC 


CTAAATATAA 


8300 


AAAACATCGC 


CGAAGGGCAT 


8350 


GCTCTCTATC 


CCATTCTCTT 


8400 


TCAACAGAAG 


GGATTTGGTA 


8450 


TCCTTTCCTC 


AAAATTGGTC 


8500 


CGCTGGCCTT 


TTCCAGAGGC 


8550 


TCCCTGCTAG 


GGCCGTCTGG 


8600 


GCAGCCCGGG 


GGATCCACTA 


8650 


ATCCCGCTCC 


CGCCCGCCGC 


8700 


CGCCGCCTCG 


CCATAAAAGG 


8750 


TGCCGCCGCA 


CCTCTCCGCC 


8800 


CCCCGCCTGG 


CGCGCGCCCC 


8850 


AATAATTAAA 


AAATAAATAA 


8900 


GATGGGGAGA 


GTGAAGCAGA 


8950 


AGGAAAGTCC 


CATAAGGTCA 


9000 


TACCGTCATT 


GACGTCAATA 


9050 


TGTACTGCCA 


AGTGGGCAGT 


9100 


ATGGAAAGTC 


CCTATTGGCG 


9150 


TCAATGGGCG 


GGGGTCGTTG 


9200 



WO 96/13597 



PCT/US95/14017 



80 



GGCGGTCAGC CAGGCGGGCC ATTTACCGTA AGTTATGTAA CGACCTGCAG 
GCTGATCTCC CTAGACAAAT ATTACGCGCT ATGAGTAACA CAAAATTATT 
CAGATTTCAC TTCCTCTTAT TCAGTTTTCC CGCGAAAATG GCCAAATCTT 
ACTCGGTTAC GCCCAAATTT ACTACAACAT CC-XCTAAAA CCGCGCGAAA 
ATTGTCACTT CCTGTGTACA CCGGCGCACA CCAAAAACGT CACTTTTGCC 
ACATCCGTCG CTTACATGTG TTCCGCCACA CTTGCAACAT CACACTTCCG 
CCACACTACT ACGTCACCCG CCCCGTICCC ACGCCCCGCG CCACGTCACA 
AACTCCACCC CCTCATTATC ATATTGGCTT CAATCCAAAA TAAGGTATAT 
TATTGATGAT GCTAGCATGC GCAAATTTAA AGCGCTGATA TCGATCGCGC 
GCAGATCTGT CATGATGATC ATTGCAATTG GATCCATATA TAGGGCCCGG 
GTTATAATTA CCTCAGGTCG ACGTCCCATC GCCATTCGAA TTCGTAATCA 
TGGTCATAGC TGTTTCCTGT GTGAAATTGT TATCCGCTCA CAATTCCACA 
CAACATACGA GCCGGAAGCA TAAAGTGTAA AGCCTGGGGT GCCTAATGAG 
TGAGCTAACT CACATTAATT GCGTTGCGCT CACTGCCCGC TTTCCAGTCG 
GGAAACCTGT CGTGCCAGCT GCATTAATGA ATCGGCCAAC GCGCGGGGAG 
. AGGCGGTTTG CGTATTGGGC GC 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic a ci £ 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

TAGTAAATTT GGGC 
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9300 

9350 
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9450 

9500 

9550 

9600 

9650 

9700 

9750 

9800 

9850 

9900 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AGTAAGATTT GGCC 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AGTGAAATCT GAAT 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQIDNO:7: 
GAATAATTTT GTGT 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYFE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
CGTAATATTT GTCT 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 



14 



8 

WANWTTTG 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19307 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
CCAATTCCAT CATCAATAAT ATACCTTATT TTGGATTGAA GCCAATATGA 
TAATGAGGGG GTGGAGTTTG TGACGTGGCG CGGGGCGTGG GAACGGGGCG 
GGTGACGTAG GTTTTAGGGC GGAGTAACTT GTATGTGTTG GGAATTGTAG 
TTTTCTTAAA ATGGGAAGTT ACGTAACGTG GGAAAACGGA AGTGACGATT 
TGAGGAAGTT GTGGGTTTTT TGGCTTTCGT TTCTGGGCGT AGGTTCGCGT 
GCGGTTTTCT GGGTGTTTTT TGTGGACTTT AACCGTTACG TCATTTTTTA 
GTCCTATATA TACTCGCTCT GCACTTGGCC CTTTTTTACA CTGTGACTGA 
TTGAGCTGGT GCCGTGTCGA GTGGTGTTTT TTTAATAGGT TTTCTTTTTT 
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ACTGGTAAGG CTGACTGTTA 
CTGGAGCGGG AGGGTGCTAT 
TTATGTGTTT TTCTCTCCTA 
TAATGTTGTC TCTACGCCTG 
CTGCAGGTCG ACTCTAGAGG 
AACCTGAAAC ATAAAATGAA 
AGCTTATAAT GGTTACAAAT 
AAGCATTTTT TTCACTGCAT 
GTATCTTATC ATGTCTGGAT 
CCCGGGCTGC AGGAATTCCG 
CAGTAAAGCA GTAATATAAT 
TGATATGTTG TGAAAATGCA 
AAATGTTACA GTGTTGGTGT 
AGAGTCCAGT ACCTGGAGAC 
CTTCAGTTAC ACTGATTATG 
GAACATGAAA TGATGTCCAA 
AGTTTTTATT CAAATATTTT 
AGATAAAACG AAAAGATTAA 
TTTTAGAATG AAACTTAAAA 
AAATCTTGGT GAAAACAAAT 
TAAAGGAGAG AGAGAGAGAA 
TATCTTAGAG CTTTGGGTTT 
ACTGGTGTCC ACACAACAGA 
ACAATTACTA GAAACACCCC 
AAGCTGTGTT TGATGTTAAT 
TAGAAGTTAA TTACACTTGA 



GGCTGCCGCT GTGAAGCGCT 
TTTGCCTAGG CAGGAGGGTT 
TTAATTTTGT TATACCTCCT 
CGGGTATGTA 1VCCCCCCAA 
ATCCGAAAAA ACCTCCCACA 
TGCAATTGTT GTTGTTAACT 
AAAGCAATAG CATCACAAAT 
TCTAGTTGTG GTTTGTCCAA 
CCCCGCGGCC GCTCTAGAAC 
TAACATAACT GCGTGCTTTA 
ACAATAGTAA GGCATATATT 
GTAAAACTGA AGTTTAAAAA 
TAAAACACAA TCTATTATGA 
AATGATGATA CATGCCATGT 
ATTTACACTT TAATACTTGA 
ATTATGCTTA AAATCAGCAA 
GATAGATTCA CTCCAGAACT 
AACAAAACTA TGCACTCTAT 
CTTCTTAGTA GGAAAGGAAC 
CCTTGGATAA AGAAAATGCC 
AAGCAAGACC AGAACCAAAT 
TCTTTTGGAA ATTATAAATG 
CAAGTGGTGA AGTTGTGAAA 
AAAACCAAAG TGAGGTAGAA 
TACAATTAAT AATGGACAAA 
CGTTAGAGGT AACAGATTTG 
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GTATGTTGTT 


450 


TTTCAGGTGT 


500 


ATGGGGGCTG 


550 


GCTTGCATGC 


600 


CCTCCCCCTG 


650 


TGTTTATTGC 


700 


TTCACAAATA 


750 


ACTCATCAAT 


800 


TAGTGGATCC 


850 


TTGAGATACA 


900 


TGGTGAAATC 


950 


AATAATTAGT 


1000 


TACTCAAGTA 


1050 


GATGATTATG 


1100 


TGGTTATAAA 


1150 


TAAAGCTCTC 


1200 


AATATCTAAA 


1250 


CTACCTTGGA 


1300 


CCCTTGTTTT 


1350 


CAGTGCCACA 


1400 


TTCAATTTGT 


1450 


AAAAAAGGAA 


1500 


TTAGGTGTGC 


1550 


ATAGCATGAG 


1600 


ACCCACTCGC 


1650 


CAAAATGATA 


1700 
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GGACAGTGAT TTCTATTGAG 
ACTGGCATGA GAGGAGTAAA 
GCACTTTTTC TCCTGGTTCA 
CGTCAACTAG ACCAGAGAGT 



AACCACTGTG 
AACCAATGCA 
CACCAGGGTC 
CTTGCAGCAT 
AAAATCTAAA 
CAACAACAAA 
ATTTAGTTGT 
CCATTCTTTT 



AAGGAGACAA 
ACAATGTGAT 
TAAAAAAAAA 
ATCTAAAATT 
ACCACCCACT 
AATTATAGAG 
ACAGAGTTGT 



84 

AGAATGCTCT TTAAATGCTA AGAAGAAGAA 
GCTCTTCCTA GCAGTCCTTA GCTTTCTGTT 
ATGACTTGCA TTTGTTTAGA CATTTCAGCC 
TTGGAGACGC Tl rTGCTCTC AAAACTTTCC 

CTTGCAGGGA 

GACTGATATT 
AATCCTAAGT 
AAAAACCCAA 
ACAACAACAA 
GGGCAGAACA 



CCTTCTCACC CACAATCCTG TGTGGAGTTA 

ATGCAGTTCA TGGGCTTCTG 



CGCAACCTTT AGCTCCTACC 



TGTCATCTGA 
AGGAAGTTGA 
TATTCATTCG 



TGTTCTATTG 
ATATATCAGC 
CATGTTCCTT 



TGGGTTACTT TCTTAACAGT 
AATCATCACA ATGAAGAAAA 
CATCATCATC ATCAACAACA 
TCAGGTTGAG TTTATGAAGA 
ATGTTTATAT GTATAGTTGT 
TGCTCCCCTC ATATAAATTG 
ATCTTCCTCC TACTGTCTGG 
CAGAAACATC ATTAAATATA 
CAACAAATTA CTATGATAGT 



AAAGTTTAAA ACTAGAAACA 



CGGTAGTCTC 
AGATAAGCCC 
ATTTGAAGAA 
TATACTTGTA 
ACTCAGTCAA 
GTTCGTATGC 
CGCTAAAGTA 
TGTGTGTGTG 



GTGTGTGTGT 



CTGGCTTTGG 
AGATGACTAG 
GTAACTTCAT 
GACATATAGA 
CAATTCTCAA 
AAGAAAATAA 
ATGCAAAACA 
GTGGGTTCGT 
GTGTGTGTGC 



GAAAAAAATG AATCCTCTAG 
TCTGGAGCCC TAGACAATAT 
GCTCCAGGGA AAATTCACTC 
AAGCAATTTC CATTAGGAAG 
ATCTATTTAT CTATATACCT 
TGTATAAAAT GAAAGCCCAT 
AAGAGCAATA TGAAGCAGTC 



AAAAACGTCA TGAATTCCAT 
ATGTGCTGCC TCAGTGTGTG 
GCATGTATGT GTGCGTGTGT 
GTGTGTGTTT GTTTAGGGGT TTTTATAAAC 



AAATATTCAT 
ACTGAGGAGC 
GAGTTAAAAA 
ACCCAACAGT 
AAGTCCTGTG 
CTCTCAGTGG 
TTTAGTGTGG 
TTGCCCAAGC 
TGGCAAGAAC 
ATAGTATTTA 
AGCCAGCCCC 
ATTTGGTGGG 
ATGAATACCA 
TGTGTGTGTG 
GTGTGTGTGT 



1750 

1800 

1850 

1900 

1950 

2000 

2050 

2100 

2150 

2200 

2250 

2300 

2350 

2400 

2450 

2500 

2550 

2600 

2650 

2700 

2750 

2800 

2850 

2900 

2950 

3000 
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AACTTTTTTT ATAAAGCACA CTTTAGTTTA 
TATAAATTTT TAAACAACCC AAAATGCGTT 
TTATTTAGCT ATCAAGATTT TACATGTTTT 
TTGCATAGAC GTGTAAAACC TGCCATTGTT 
AGAAACTACT GAAATCTACA GTATAGTACC 
AGATTTTATT TCTTGTAAAC TCTTACTGTC 
ATATTATAAA AACCATGCGG GAATCAGGAG 
TCCTTCTTCA TCTGTCATGA CTGAAACTAA 
AATCATCTGC CATGTGGAAA AGGCTTCCTA 
GCTTTCCGGG GGCATTTCTT CCTCTTGAAC 
TGCTCCATCA CTTCTTCTAA CCCTGTGCTT 
AAGATCTTCC TCACCCATAG ATTCTGAAGT 
GCAGCATAGG CTGACTGCTA TCTGACCTCT 
GACACCGTGG TGCCATTCAC CTTAGCTTCA 
CTGTCTCAGT CTATGTAACT GAGACTCCAG 
GGATTTGCAT CCTGGCTTCC AGGCGTCCTT 
GCCTCAGCAA TGAGCTCAGC ATCCCTGGGA 
CATCTCAGGA GGAGATGGCA GTGGAGACAG 
GCTTCAGGCG ATCATATTCT GCTTGCAGAT 
TCTGCTAGGA TTCTCTCTAG CTCCCCTCTT 
CAAGATCTGG GCAGGACTAC GAGGCTGGCT 
AACTTTGGCA GTAATGCTGG ATTAACAAAT 
TTAGGAGAGA TGCTATCATT TAGATAAGAT 
TGCTAGCCTG CTAGCATAAT GTTCAATGCG 
AAAGCTGGGG GGACGAGGCA GGCGCAGAAT 
AGAGTAACGG GAGTTTCCAT GTTGTCCCCC 
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CAATCTCTCT 


TTATAACTGT 


3050 


CCATATAAAG 


AAATGGCAAG 


3100 


CTTTTAACTT 


TTTTGTACAA 


3150 


AACAAAACAA 


TAACAGACTT 


3200 


ACTACCCTTC 


ACAAAAATAT 


3250 


TAATCCTCTT 


TGTTGTACGA 


3300 


TTGTAAAACA 


TTTATTCTGC 


3350 


GGACTCCATC 


GCTCTGCCCA 


3400 


CATTGTGTCC 


TCTCTCATTG 


3450 


TAGGGAAGGA 


GTTGTTGAGT 


3500 


GTGTCCTGGG 


GAGGACTCAG 


3550 


TTGACTGCCA 


ACCACTCGGA 


3600 


GCAGAGAGGT 


GGAAGGAGAG 


3650 


GCCTGGGGCT 


GCTCCAGGAG 


3700 


CTGTTTATTG 


TGGTCTTCCA 


3750 




CAGTAGCTTA 


3800 


CTCTGAGGAG AGGTGGG CAT 


3850 


GCCTTTATGC 


TCATGCTGCT 


3900 


TCCTGTTTTC 


TTCCTCAAGA 


3950 


TCCTCACTCT 


CTAAGGAAAT 


4000 


CAGGGGGGAG 


TCCTGGTTCA 


4050 


GTTCATCATC 


TATGCTCTCA 


4100 


CCATTGCTGT 


TTTCCATTTC 


4150 


TGAATGAGTA 


TCATCGTGTG 


4200 


CTACTGGCCA 


GAAGTTGATC 


4250 


TCTAACACAG 


TCTGCACTGG 


4300 
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CAGGTAGCCC ATTCGGGGAT GCTTCGCAAA ATACCTTTTG GTTCGAAATT 
TGTTTTTTAG TACCTTGGCG AAGTCGCGAA CATCTTCTCC GGATGTAGTC 
GGAGTGCAAT ACTCTACCAT GGGGTAGTGC ATTTTATGGC CCTTTGCAAC 
TCGGCCAGAA AAAAAGCAAC TTTGGCAGAT GTcATAATTA AAATGCTTTA 
GGCTTCTGTA CCTGAATCCA ATGATTGGAC ACTCCTTACA GATGTTACAC 
TTGGCTTGAT GCTTGGCAGT TTCAGCAGCA GCCACTCTGT GCAAGACGGG 
CAGCCACACC ATAGACTGGG GTTCCAGGCG CATCCAGTCA AGGAAGAGAG 
CAGCTTCAAT CTCAGGTTTA TTATTGGCAA ATTGGAAGCA GCTCCTGACA 
CTCGGCTCAA TGTTACTGCC CCCAAAGGAA GCAACTTCAC CCAACTGTCT 
TGGGATTTGA ATAGAATCAT GCAGAAGAAG ACCCAGCCTA CGCTGGTCAC 
AAAAGCCAGT TGAACTTGCC ACTTGCTTGA AAAGGTATCT GTACTTGTCT 
TCCAAGTGTG CTTTACACAG AGAAATGATG CCAGTTTTAA AAGACAGGAC 
ACGGATCCTC CCTGTTCGTC CCGTATCATA AACATTGAGA AGCCAGTTGA 
GACACATATC CACACAGAGA GGGACATTGA CCAGATTGTT GTGCTCTTGC 
TCCAGACGAT CATAAATTGT AGTCAAACAG TTAATTATCT GCAGGATATC 
CATGGGCTGG TCATTTTGCT TGAGGTTGTG CTGGTCCAGG GCATCACATG 
CAGCTGACAG GCTCAAGAGA TCCAAGCAAA GGGCCTTCTG GAGCCTTCTG 
AGCTTCATGG CAGTCCTATA CGCGGAGAAC CTGACATTAT TCAGGTCAGC 
TAAAGACTGG TAGAGCTCTG TCATTTTGGG GTGGTCCCAA CAAGTGGTTT 
GGGTCTCGTG GTTGATATAG TAGGGCACTT TGTTTGGTGA GATGGCTCTC 
TCCCAGGGAC CCTGAACTGA AGTGGAAAGG AAGTGCTGGG ATGCAGGACC 
AAAGTCCCTG TGGGCTTCAT GCAGCTGTCT GACACGGTCC TCCACAGCCA 
CCTGTAGAAG CCTCCATCTG GTATTCAGAT CTTCCAAAGT GCTGAGGTTA 
TAAGGTGAGA GCTGAATGCC CAGTGTGGTC AGCTGATGTG CAAGGTCATT 
GACACGATTG ACATTCTCTT TAAGAGGTGC AATTTCTCCC CGAAGTGCCT 
TGACTTTTTC AAGGTGATCT TGCAGAGAGT CAATGAGGAG ATCCCCCACT 
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GGCTGCCAGG 


ATCCCTTGAT 


CACCTCAGCT 


TGGCGCAACT 


TGAGGTCCAG 


5650 


TTCATCGGCA 


GCTTCCTGAA 


GTTCCTGGAG 


TCTTTCAAGA 


GCTTCATCTA 


5700 


TTTTTCTCTG 


CCAATCAGCT 


GAGCGCAGGT 


TCAATTTGTC 


CCATTCAGCG 


5750 


TTGACCTCTT 


CAGCCTGCTT 


TCGTAGGAGC 


CfcAGTGACAT 


TCTGAGCTCT 


5800 


TTCTTCAGGA 


GGCAGTTCTC 


TGGGCTCCTG 


GTAGAGTTTC 


TCTAGTCCTT 


5850 


CCAAAGGCTG 


CTCTGTCAGA 


AATATTCTCA 


CAGTCTCCAG 


AGTACTCATG 


5900 


ATTACAGGTT 


CTTTAGTTTT 


CAATTCCCTC 


TTGAAGGCCC 


TATGTATATC 


5950 


ATTCTGCTTC 


TGAACTGCTG 


GGAAATCACC 


ACCGATGGGT 


GCCTGACGGC 


6000 


TCAGTTCATC 


ATCTTTCAGC 


TGTAGCCAAA 


CAAGAAGTTC 


CTGAAGAGAA 


6050 


AGATGCAAAC 


GCTTCCACTG 


GTCAGAACTT 


GCTTCCAAAT 


GGGACCTAAT 


6100 


GTTGAGAGAC 


TTTTTCTGAA 


GTTCACTCCA 


CTTGAAATTC 


ATGTTATCCA 


6150 


AACGTCTTTG 


TAACAGGGGT 


GCTTCATCCG 


AACCTTCCAG 


GGATCTCAGG 


6200 


ATTTTTTGGC 


CATTTTCATC 


AAGATTGTGA 


TAGATATCTG 


TGTGAGTTTC 


6250 


AATTTCTCCT TGGAGATCTT 


GCCATGGTTT 


CATCAGCTCT 


CTGACTCCCC 


6300 


TGGAGTCTTC 


TAGGAGCTTC 


TCCTTACGGG 


AAGCGTCCTG 


TAGGACATTG 


6350 


GCAGTTGTTT 


CTGCTTCCGT 


AATCCAGGAA 


AGAAACTTCT 


CCAGGTCCAG 


6400 


AGGGAACTGC 


TGCAGTAATC 


TATGAGTTTC 


TTCCAAAGCA 


GCCTCTTGCT 


6450 


CACTTACTCT 


TTTATGAATG 


TTTCCCCAAG 


AAGTATTGAT 


ATTCTCTGTT 


6500 


ATCATGTGTA 


CTTTTCTGGT 


ATCATCAGCA 


GAATAGTCCC 


GAAGAAGTTT 


6550 


CAGTGCCAAA 


TCATTTGCCA 


CGTCTACACT 


TATCTGCCGT 


TGACGGAGGT 


6600 


CTTTGGCCAA 


CTGCTTGGTT 


TCTGTGATCT 


TCTTTTGGAT 


TGCATCTACT 


6650 


GTGTGAGGAC 


CTTCTTTCCA 


TGAGTCAAGC 


TTGCCTCTGA 


CCTGTCCTAT 


6700 


GACCTGTTCG 


GCTTCTTCCT 


TAGCTTCCAG 


CCATTGTGTT 


GAATCCTTTA 


6750 


ACATTTCATT 


CAACTGTTGT 


CTCCTGTTCT 


GCAGCTGTTC 


TTGAACCTCA 


6800 


TCCCACTGAA 


TCTGAATTCT 


TTCAATTCGA 


TCAGTAATGA 


TTGTTCTAGC 


6850 


TTCTTGATTG 


CTGGTTTTGT 


TTTTCAAATT 


CTGGGCAGCA 


GTAATGAGTT 


6900 
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CTTCCAATTG 
TTGATGATCA 
TGATTTTATA 
AAGCTCGGTT 
GGCATTTCTA 
CACTAGAGTA 
GGGCACGGTC 
GCCTCCCACT 
GCTTGGTTTT 
CATCCGCTTG 
GGTCCTGCCT 
AGAGACCCAC 
CTTTTAAGTG 



ACCTTTATCC 
TTGCTCTTCT 
CATTTAATTG 
AACTTGACTT 
AGTAATAGCA 
AATCTCTTTG 
CTCCTTTCTG 
TCCTAATTTT 



GGGGCGTCTC 
TTTCATTGAT 
ACTCGATCAA 
GAAGTCTGCC 
GTTTGGAGAT 
ACAGTCTGAC 
AGGCTGCTTT 
CAGACCTCAG 
TCCTTATACA 
TTTACCGTGA 
GACTTGGTTG 
AGAAGCAGGT 
AACCTCAAGC 
ACTGGAGATT 
GGCCTTATGG 



TTATTTCTTC 
CCAATGCCAT 



TTTTAGAATT 
GTTCAAGTTG 
ATGTTATCTG 
AAATTCTGAC 
CCAGCTCTTT 
TCTTGTAGAA 
CCCAGTTGCA 
CCTGGAGTTC 



TGTTCCAAAT 
GTCTTCCAGA 
GCAGAGACAG 
AGTGCAGGTA 
GACAGTTTCC 
TGGCAGAGGC 
GTCCTCAGCT 
ATCTTCTAAC 
AATGCTGCCC 
ACTGTTACTT 
GTTATAAATT 
GATCCAGCTG 
TCTCCTTGTT 
TGTCTGTTTG 
GAGCACTTAC 
CCCTGGCGCA 
TTCTTTTAGC 
CTTCTTCCAG 
AAGACATTCT 
GCAGATGTCG 
TATTGACATC 
TTCAGTGTTC 
CTTAAGATAC 



CTTGCAGTGT 
TCACCCACCA 
CCAGTCTGTA 



CCJCCAACAG 
TTAGTAACCA 
TCCAGTAGTG 
CCCGAAGTAA 
TTCCTCTTCA 
TTTCGACAAA 
CAATCTCCTT 
TCCAACTGGT 
CTCTTCAAGC 
TCTCAGGTAA 
AGCTTCTTTT 
AAGTACTGCT 
GGGGCAACTC 
TGCTGCTCAT 
CCACAAAACA 
TTTGTTCTTC 
TGCCACCGCA 
TGTTTTTGAA 



CCAGTTTTCA GGATTTTGTG TCTTTTTGAA 



GCCATTGATT 
ACTGATCTGT 



AAATACCTTC 
CGAATCGCCC 



ATATCATAAT 
TTGTCGTTCC 



TGACAACAGC 
CATTTGTATT 
AAACTGTTCA 
GAAAGTGTCG 
TTGTACATTC 



TGCCTTCTGT 
TCACTCTCTG 
AGTTCTGTCC 
CAAAGAAGAT 
CAGATTGTGT 
CTCAGTCCAG 
ATGGTTTACA 
CTGGCTGAGT 
AGCCTTTCCA 
TATGTCAAAC 
TTCTAATAGG 
TGCCTAAAAT 
AGCTCTGGAG 
CAAGTTTATC 
CCTCCTGTTT 
TTCTGCCAGT 
CTCCAAGTGG 
AATTCATTTA 
AATCCTCTTT 
GACTCAAGCT 
GACTGTTGAA 
TTGACGCTGC 
TAGCATGTTC 
ACTTCATTCA 



CCATTTTTCA 



TATGAAGTTT 



6950 
7000 
7050 
7100 
7150 
7200 
7250 
7300 
7350 
7400 
7450 
7500 
7550 
7600 
7650 
7700 
7750 
7800 
7850 
7900 
7950 
8000 
8050 
8100 
8150 
8200 
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TTCCCCCTGG 


AAATCCATCT GTGCCACGGC 


TTCCTGTACT 


TTCACCTTTT 


8250 


CCATGGAGGT 


GGCACTTTGC AAGGCTGCTG 


TCTTCTTCTT 


GTGAATAATA 


8300 


TCAATCCGAC 


CTGAGATTTG TTGCAAATTG 


TCTTTTATAT 


TCTTAAGAGA 


8350 


CTCCTCTTGC 


TTAAAAAGAT CTTCAAAATC 


IVTAGCACAG 


AGTTCAGGAG 


8400 


TATTTAGAAG 


ATGATCAACT TCTGAAAGAG 


CTTGTAAGAT 


ATGACTGATC 


8450 


TCGGTCAAAT 


AAGTAGAAGG CACATAAGAA 


ACATCCAAAG 


GCATATCTTC 


8500 


AGTCGTCACT 


ACCATAGTTT CTTCATGGAG 


* 

AGTGTGAATT 


TGTGCAAAGT 


8550 


TGAGTCTTCG 


AAACTGAGCA AAATTGCTCT 


CAATTTGCCG 


CCAGCGCTTG 


8600 


CTGAGCTGGA 


TCTGAGTTGG CTCCACTGCC 


ATTGCGGCCC 


CATTCTCAGA 


8650 


CAAGCCCTCA 


GCTTGCCTGC GCACTGCATT 


CAGCTCCTCT 


TTCTTCTTCT 


8700 


GCAATTCACG 


ATCAATTTCC TTTAATTTTC 


TTTCATCTCT 


GGGTTCAGGT 


8750 


AGGCTGGCTA 


ATTTTTTTTC AATTTCATCC AAGCATTTCA GGAGATCATC 


8800 


AGCCTGCCTC 


TTGTACTGAT ACCACTGGTG AGAAATTTCT AGGGC^riTT 


8850 


TTCTTCTTTG 


AGACCTCAAA TCCTTGAGAG 


CATTATGTTT 


TGTCTGTAAC 


8900 


AGCTGCTGTT TTATCTTTAT TTCCTCTCGC TTTCTCTCAT 


CTGTGATTCT 


8950 


TTGTTGTAAG 


TTGTCTCCTC TTTGCAACAA TTCATTTACA GTACCCTCAT 


9000 


TGTCTTCACT 


CATATCTTTA TTGAAGTCTT CCTCTTTCAG ATTCACCCCC 


9050 


TGCTGAATTT 


CAGCCTCCAG TGGTTCAAGC AATTTTTGTA TATCTGAGTT 


9100 


AAACTGCTCC 


AATTCCTTCA AAGGAATGGA 


GGCCTTTCCA 


GTCTTAATTC 


9150 


TGTGAGAAAT AGCTGCAAAT CGACGGTTGA GCTCAGAGAT TTGGGGCTCT 


9200 


ACTACTTTCC 


TGCAGTGGTC ACCGCGGTTT GCCATCAATT TTGCTGCTTG 


9250 


GTCACGTGTG 


GAGTCCACCT TTGGGCGCAT 


GTCATTCATT 


TCAGCCTTTA 


9300 


AACGCTTAAG 


AATGTCTTCC TTTTGTTGTG 


GTTTCrrCTT 


TTCAGACTCA 


9350 


TCTAAAAGTT 


CATCTGCATG AATGATCCAC 


TTTGTGATTT 


GTTCTATGTT 


9400 


CTGATCAAAG 


GTTTCCATGT GTTTCTGGTA 


TTCCAACAAA 


AGATTTAGCC 


9450 


ATTCTTCTAC 


TCTGGAGGTG ACAGCTATCC 


AGTTACTGTT 


CAGAAGACTC 


9500 
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9550 



9800 
9850 
9900 



AGTTTATCTT CTACCAAGGT TTCTTTCTTG CCCAACACCA TTTTCAAAGA 
CTCTCCTAAT TCTGTAACAC TCTTCAAGTG AGCCTTCTGT TTCTCAATCT «00 
CTTTTTGAGT AGCCTTTCCC CAGGCAACTT CAGAATCCAA ATTACTTGGC 9650 
ATTCCTTCAA CTGCTGATCT CTTOGTCAAT TC.GTATCTG TTGCTGCCAG 9700 
CCA3TCTGTT AAGACATTCA TTTCCTTTCT CATCTTACGG GACAACTTCA 9750 
AGCATTTCTC CAACTGTTGC TTTCTCTCTG TTACCTTCGC ACCCAACTCA 
TTGTAATGCA ATTTCAAAGC TGTTACTCGT TCATCAAGCT CTTTGGGAIT 
TTCTGTCTGC TTTTTCTGTA CAATTTGACG TCCGGTTTTA ATCACCATTT 
CCACTTCAGA CTTGACTTCA CTCAGGCTTT TATACAAGTT CACACAATGA 9950 
CTTAGTTGTG ACTGAATTAC TTCCTGTTCA ACACICTTGG TTTCCAATGC 10000 
AGGCAAATGC ATCTTGACTT CATCTAAAAT CATCTTACTT TCCTCTAGAC 10050 
GTTGTTCAAA ATTGGCTGGT TTTTGGAATA ATCGAAATTT CATGGAGACA 10100 
TCTTGTAATT TTTTCTGTGC AACATCAATT TGTGAAAGAA CCCTTTGGTT 10150 
GGCATCCTTC CCCTGGTTAT GTTOCTTCAT TTCTTCTAAA CTTATCTCAT 10300 
GACTTGTCAA ATCTGATTGG ATTTTCTGGG CTTCCTGAGG CATTTGAGCT 10150 
GCATCCACCT TGTCAGTGAT ATAAGCTGCC AACTGCTTGT CAATGAATTC 10300 
AAGCGACTCC TGAATTAAGT GCAAGGACTT TTCAATTTCC TGGGCAGACT 10350 
GGATAOTCTG TTCAAGCAAC TTTTGTTTCC TCACAGCCTC TTCATGTAGT 10400 
TCCCTCCAAC GAGAATTAAA CGTCTCAAGC TCCTCA3TGA TCAGTTCATC 10450 
CATGACTCCT CCATCTGTAA GAGTCTGTGC CAATAGACGA ATCTGATTTG 10500 
GGTTCTCCTC TGAATGATGC ATCAGATTTT CAAGAGATTC TAGCACTTCA 10550 
GTGATTTCCT CAGGTCCTGC AGGAACATTT TCCATGGTTT TAAGTTTCAA 10500 
TTCTACTTCA TTGAGCCACT TGTTTGCTTT CTCTAAATAT GACAATAACT 10550 
CA3GCCAACA TGCCCAAACT TCTTCCAAAG TTTTGCATTT TCCATTCAGC 10700 
CTGGTGCACA GCCATTGGTA GTTGGTGGTC AGAGTTTCAA GTTCCTTTTT 10750 
TAAGGCCTCT TGTGCTGAGG GTGGAGCGTG AGCTATTACA CTATTTACAG 10500 
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.91 

TCTCAGTAAG GAGTTTCACT TTAGTTTCTT 
GCTCTCTTCA TTTCTTCAAC AGCAGTCTGT 
TTCAAAATCT CTCTCTAGAT ATTCTTCTTC 
GCATCTCTGA TAGATCTTTT TGGAGGCTTA 
TTTAAGGCTT CCTTTCTGGT GTAGACCTGG 
AGTGTTAAGC TCTCTAAGTT CTGTCTCCAG 
CAGCTTCACT CTTTATCTTC TGCCCACCTT 
GGCTGAATTG TTTGAATATC ACCAACTAAA 
TTTTTTCAGG ATTTCAGCAT CCCCCAGGGC 
AAACATCAAC TTCAGCCATC CATTTCTGTA 
AATTTTCGAA GTTTATTCAT ATGTTCTTCT 
CAACTGGGAG GAAAGTTTCT TCCAGTGCCC 
ACAGATATTT CTGGCATATT TCTGAAGGTG 
ACAGTGTCAC TCAGATAGTT GAAGCCATTT 
TTGCAGAGCC TGTAATTTCC CGAGTCTCTC 
TAACACTAAG ATAAGGTACA GAGAGTTTGC 
GTCCTGATGC TACTCATTGT CTCCTGATAG 
AAAAATTGTC TGTAGCTCTT TCTCTTTGGC 
GGTTAAAATG ATTAGTAAAG GCCACAAAGT 
CCCTGTCCCT TTTCTTTCAG TTGTAGACTC 
TTGAGGCTGA AGAGCTGACA ATCTGTTGAC 
ACTGGCTTTT AATTGCTGTT GGCTCTGATA 
AACAAGTTTT CGGCAGTAGT TGTCATCTGT 
ATAAAAGGTA ATGATGTTGG TTTGATACTC 
TCAGCAATTG GCAGAATTCT GTCCACCGGC 
TGTCTGATAC TTTCAGCATT AACACCCTCA 
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TTTGTAGTGC 


CTCTTCTTTA 


10850 


AATTCATCTG 


GAGTTTTATA 


10900 


AGCTTGTGTC 


ATCCACTCAT 


10950 


CGJTTTTATC 


CAAACCTGCC 


11000 


CGGCATATGT 


GATCCCACTG 


11050 


TCTGGATGCA 


AACTCAAGTT 


11100 


CATTAACACT 


ATTTAAACTG 


11150 


AGTCTGCATT 


GTTTGAGCTG 


11200 


AGGCCATTCC 


TCTTTCAGGA 


11250 


AGGTTTTTAT 


GTGATTCTGA 


11300 


AGCTTTTGGC 


AGCTTTCCAC 


11350 


CTCAATCTCT 


TCAAATTCTG 


11400 


CTTTCTTGGC 


CATCTCCTTC 


11450 


TGTTGCTCTT 


TCAAAGAACT 


11500 


CTCCATTATT 


TCATATTCAG 


11550 


TTTCTGACTG 


CTGGATCCAC 


11600 


CGCATTGGTG 


GTAAAGTGTC 


11650 


CCTCACACCA 


TCAAAGATGT 


11700 


CTGCATCCAG 


AAACATTGGC 


11750 


TGAATTTTTA 


ATTGCTCAAT 


11800 


TTCATCCTTA 


CAAATTTTTA 


11850 


GGGTGGTAGA 


CTGGGTTTTC 


11900 


TCCAATTGTT 


GTAGCTGATT 


11950 


TAGCCAGTTA 


ACTCTCTCAC 


12000 


TGTTCAGTTG 


TTCTGAAGCT 


12050 


TTTGCCATCT 


GTTCCACCAG 


12100 
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GGCCTGAGCT GATCTGCTGC CATCTTGCAG TTTTCTGAAC TTCTCTGCTT 12150 
TTTCTCGTGC TATGGCATTG ACTTTTTCTT GCAAGTCTGA GATGTTGCCT 12200 
TCTTTTCGAT AGACTGCAAA TTCAGAACTC TGTAATACAG CTTCTGAACG 22250 
AGTAATCCAA CTGTGAAGTT CAGTTATATC GAOATCCAAC C1TTTCCTGA 12300 
GTTCAGAATC CACAGTTATC TGCCTCTTCT TTTGAGGAGG TGGTGGTGGA 12350 
AGTTCCTCTT GGGCATGTTT TACCATGATT TGTTCCCPTG TGGTCACCAT 12400 
AGTTACCGTT TCCATTACAG TTGTCTGTGT TAGGGATGGT TGAGTGGTGG 12450 
TGACAGCCTG TGAAATTTGT GCTGAACTCT TTTCAAGTTT TTGGGTTAAA 12500 
TTGTCCCAAC GTTGTGCAAA GTTTTCCATC CAGATTTCCA TCTTTTGAGT 12550 
CACTGACTTA TTTTTCAGTG CCGAAAGTAG ATCTTGATTG AGTGAACTTA 12600 
^^TCCAT GGTTGGCTTT TTCTTTTCTA GATCTATTTT TAAAGTAGAT 12G50 
ATTTTGTGAA GACTTGACAT CATTTCATTT TGATCHTAA AGCCACTTGT 12700 
CTGAATGTTC TTCATTGCAT CTTCTTTTTC TGAAAGCCAT GTACTAAAAA 12750 
OGCACTGTTC TTGAGTAAAA TGCIGCCATI TTAGAAGAAI ATCTTGTAAA 12000 
ACAATCCAGC GGTCTTCAGT CCATCTGCAG ATATTTGCCC ATCGATCTCC 12B50 
CAGTACCTTA AGTTGTTCTT CCAAAGCAGC TGTTGCATGA TCACCGCTGG 12900 
ATTCATCAAC CACTACTACC ATGTGAGTGA GCGAGTTGAC CCTGACCTGC 12950 
TCCTGTTCTA GATCTTCTTG AAGCACCTTA TOTTGTTGTA CTTGGCATTT 13000 
TAGATCTTCA AGATCAGGTC CAAAGGGCTC TTCCTCCATT TTCTTAGTTC 13050 
TCTCTTCAGT ITTTGTTAAC CAGTCATCTA GTTCTTTTAA TTTCTGATTC 13100 
TGGAGATCCA TTAGAACTTT GTGTAATTTG cmGTTTTT CCATGCTAGC 13150 
TACCCTGAGA CATTCCCATC TTGAATTTAG GAGATTCATT TGTTCTTGCA 13200 
CTTCAGCTTC TTCAICTTCT GATAATTTCC CTTTTCCAAC TAGTTGACTT 13250 
CCTAACTGTA GAACATTACC AACAAGTCCT TGATGAGATG TCAGATCCAT 13300 
CATGAATCCC TCATGAGCAT GAAACTGTTC TTTCACTTCT TCAACATCAT 13350 
TTGAAATCTC TCCTTGTGCT CGCAATGTAT CCTCGGCAGA AAGAAGCCAT 13400 
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GAAAGTACTT CTTCTAAAGC AGTTTGGTAA 

* 

CTCCATCAAT GAACTGTCAA GTGACTTGTC 
GTGAAGGATA GGGGCTCTGT GTGGAATCAG 
TGTGTGAAGG CATAACTCTT GAATCGAGGC 
TTCATAGCCC TGTGCTAGAC TGACTGTGAT 
GGTGATGTAA TTGAAAATGT TCTTCTCTAG 
GGCAACATTT CCACTTCTTG AATGGCTTCA 
AACTTGAAAG AGTGATGTGA TGTACATTAA 
AAGTGGTAGC AACATCTTCA GGATCAAGAA 
CATTTTGCAA TGTTGAAGGC ATGTTCCAGT 
TGAAACCACA CTATTCCAAT CAAACAGGTC 
GAGCATTCAA AGCCAACCCG TCGGACCAGC 
TTAACCTGTG GATAATTACG TGTTGACTGT 
CTTTTCACTG TTGGTTTGCT GCAATCCAGC 
TTTTGACCTG CCAGTGGAGG ATTATATTCC 
TTATGATTTC CATCCACTAT GTCAGTGCTT 
ATTATTTTTC TGTAAGACCC GCAGTGCCTT 
GAACTCTTGT AGATCCCTTT TCTTTTGGCA 
TCCAAGAGGT CTAGGAGGCG TTTTCCATCC 
GTCTATGTGT TGCTTTCCAA ACTTAGAAAA 
TGAATGTTTT CTTTTGAACA TCTTCTCTTT 
TCCCACCAAA GCATTTGGAA GAAAAAGTAT 
ATCTTGGTAA AAGTTTCTCC CAGTTTTATT 
GATGAGAAGC CAATAAACTT CAGCAGCCTT 
TAGCACTTCA AGTCTTCCTA TTCGTTTTTT 
AGAGCGGAAT TCCTGCAGCC CGGGGGATCC 
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CTATCCAGAT 


TTACTTCCGT 


13450 


TCTGGGAGCT 


TCCAAATGCT 


13500 


AGGTGGCAAC 


ATAAGCAGCC 


13550 


TxAGGAGATG 


AAGAAGTTTG 


13600 


CTGTTGAGAG 


TAATGCATCT 


13650 


TTACTTTTGA 


AGATGTCCTG 


13700 


ATGCTCACTT 


GTTGTGGCAA 


13750 


GATGGACTTC 


TTGTCTGGAT 


13800 


GTTTTTCTAT 


GCCTAACTGG 


13850 


CTTTGGGTGG 


CTGAGTGCTG 


13900 


GGGCCTGTGA 


CTATGGATAA 


13950 


TAGAGGTGAA 


GTTGATGACG 


14000 


CGAACCCAGC 


TCAGAAGAAT 


14050 


CATGATAGTT 


TTCATCACAT 


14100 


AAATCAAACC 


AAGAGTGAGT 


14150 


CCTATATTCA 


CTAAATCAAC 


14200 


GTTGACATTG 


TTCAGGGCAT 


14250 


6TTTTTGCCC 


TGTAAGGCCT 


14300 


TGCAGGTGAC 


TGAAGAGGTT 


14350 


TTGTGCATTT 


ATCCATTTTG 


14400 


CATAACAGTC 


CTCTACTTCT 


14450 


ATATCAAGGC 


AGGGATAAAA 


14500 


GCTCCAGGAG 


GCTTAGGTAC 


14550 


GACAAAAAAA 


AAAAAAAAAA 


14600 


CTATAAAGCT 


ATTGCCTTCA 


14650 


ACTAGTTCTA 


GAGCGGCCGC 


14700 



WO 96/13597 



PCTAJS95/14017 



94 



GGCTACAATT CCGCAGCTTT TAGAGCAGAA GTAACACTTC CGTACAGOCC 14750 
TAGAAGTAAA GGCAACATCC ACTGAGGAGC AGTTCTTTGA TTTGCACCAC 14800 
CACCGGATCC GGGACCTGAA ATAAAAGACA AAAAGACTAA ACTTACCAGT 14850 
TAACTTTCTG GTTTTTCAGT TCCTCGAOTA CCGATCCTC TAGAGTCCGG 14900 
AGGCTGGATC GGTCCCGGTG TCTTCTATGG AGGTCAAAAC AGCGTGGATG 14950 
GCGTCTCCAG GCGATCTGAC GGTTCACTAA ACGAGCTCTG CTTATATAGA 15000 
CCTCCCACCG TACACGCCTA CCGCCCATTT GCGTCAATGG GGCGGAGTTG 15050 
TTACGACATT TTGGAAAGTC CCGTTGATTT TGGTGCCAAA ACAAACTCCC 15100 
ATTGACGTCA ATGGGGTGGA GACTTGGAAA TCCCCGTGAG ICAAACCGCT 15150 
AT CCACGCCC ATTGATGTAC TGCCAAAACC GCATCACCAT GGTAATAGCG 15200 
ATGACTAATA CGTAGATGTA CTGCCAAGTA GGAAAGTCCC ATAAGGTCAT 15*50 
GTACTGGGCA TAATGCCAGG CGGGCCATTT ACCGTCATTG ACGTCAATAG 15300 
GGGGCGTACT TGGCATATGA TACACTTGAT GTACTGCCAA GTGGGCAGTT 15350 
TACCGTAAAT ACTCCACCCA TTGACGTCAA TGGAAAGTCC CTATTGGOGT 15400 
TACTATGGGA ACATACGTCA TTATTGACGT CAATGGGCGG GGGTCGTTGC 15450 
CCGGTCAGCC AGGCGGGCCA TTTACCGTAA GTTATGTAAC GACCTGCAGG 15500 
TCGACTCTAG AGGATCTCCC TAGACAAATA TTACGCGCTA TGAGTAACAC 15550 
AAAATTATTC AGATITCACT TCCTCTTATT CAGT^CCC GCGAAAATGG 15800 
CCAAATCTTA CTCGGTTACG CCCAAATTTA CTACAACATC CGCCTAAAAC 15550 
CGCGCGAAAA TTGTCACTTC CTGTGTACAC CGGCOCACAC CAAAAACGTC 15,00 
ACTTTTGCCA CATCCGTCGC TTACATGTGT TCCGCCACAC TTGCAACATC 15750 
iC ACTTCCGC CACACTACTA CGTCACCCGC CCCGTTCCCA CGCCCCGCGC 15800 
CACGTCACAA ACTCCACCCC CTCATTATCA TATTGGCTTC AATCCAAAAT 15.50 
AAGGTATATT ATTGATGATG CTAGCGGGGC CCTATATATG GATCCAATTG 15900 
CAATGATCAT CATGACAGAT CTGCGCGCGA TCGATATCAG CGCTTTAAAT 15950 
TTGCGCATGC TAGCTATAGT TCTAGAGGTA CCGGTTGTTA ACGTIACCCC 15000 
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GCTACGTATA CTCCGGAATA TTAATAGGCC 
GCCGCCTGCA GCTGGCGCCA TCGATACGCG 
GTACAGAGCT CGAGAAGTAC TAGTGGCCAC 
TTGGCACTGG CCGTCGTTTT ACAACGTCGT 
TACCCAACTT AATCGCCTTG CAGCACATCC 
ATAGCGAAGA GGCCCGCACC GATCGCCCTT 
AATGGCGAAT GGCGCCTGAT GCGGTATTTT 
TATTTCACAC CGCATACGTC AAAGCAACCA 
CGCATTAAGC GCGGCGGGTG TGGTGGTTAC 
TTGCCAGCGC CCTAGCGCCC GCTCCTTTCG 
GCCACGTTCG CCGGCTTTCC CCGTCAAGCT 
AGGGTTCCGA TTTAGTGCTT TACGGCACCT 
TGGGTGATGG TTCACGTAGT GGGCCATCGC 
CCTTTGACGT TGGAGTCCAC GTTCTTTAAT 
TGGAACAACA CTCAACCCTA TCTCGGGCTA 
TTTTGCCGAT TTCGGCCTAT TGGTTAAAAA 
TTTAACGCGA ATTTTAACAA AATATTAACG 
TCTCAGTACA ATCTGCTCTG ATGCCGCATA 
CGCCAACACC CGCTGACGCG CCCTGACGGG 
GCTTACAGAC AAGCTGTGAC CGTCTCCGGG 
TTCACCGTCA TCACCGAAAC GCGCGAGACG 
TATTTTTATA GGTTAATGTC ATGATAATAA 
GGCACTTTTC GGGGAAATGT GCGCGGAACC 
AATACATTCA AATATGTATC CGCTCATGAG 
TTCAATAATA TTGAAAAAGG AAGAGTATGA 
GCCCTTATTC CCTTTTTTGC GGCATTTTGC 
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TAGGATGCAT 


ATGGCGGCCG 


16050 


TACGTCGCGA 


CCGCGGACAT 


16100 


GTGGGCCGTG 


CACCTTAAGC 


16150 


GACTGGGAAA 


ACCCTGGCGT 


16200 


CCCTTTCGCC 


AGCTGGCGTA 


16250 


CCCAACAGTT 


GCGCAGCGTG 


16300 


CTCCTTACGC ATCTGTGCGG 


16350 


TAGTACGCGC 


CCTGTAGCGG 


16400 


GCGCAGCGTG 


ACCGCTACAC 


16450 


CTTTCTTCCC 


TTCCTTTCTC 


16500 


CTAAATCGGG 


GGCTCCCTTT 


16550 


CGACCCCAAA 


AAACTTGATT 


16600 


CCTGATAGAC 


GGTTTTTCGC 


16650 


AGTGGACTCT 


TGTTCCAAAC 


16700 


TTCTTTTGAT 


TTATAAGGGA 


16750 


ATGAGCTGAT 


TTAACAAAAA 


16800 


TTTACAATTT 


TATGGTGCAC 


16850 


GTTAAGCCAG 


CCCCGACACC 


16900 


CTTGTCTGCT 


CCCGGCATCC 


16950 


AGCTGCATGT 


GTCAGAGGTT 


17000 


AAAGGGCCTC 


GTGATACGCC 


17050 


TGGTTTCTTA 


GACGTCAGGT 


17100 


CCTATTTGTT 


TATTTTTCTA 


17150 


ACAATAACCC 


TGATAAATGC 


17200 


GTATTCAACA 


TTTCCGTGTC 


17250 


CTTCCTGTTT 


TTGCTCACCC 


17300 
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AGAAACGCTG GTGAAAGTAA AAGATGCTGA AGATCAGTTG GGTGCACGAG 17350 
TGGGTTACAT CGAAC1GGAT CTCAACAGCG GTAAGATCCT TGAGAGTTTT 17400 
CGCCCCGAAG AACGTTTTCC AATGATGAGC ACTTTTAAAG TTCTGCTATG 17450 
TGGCGCGGTA TTATCCCGTA TTGACGCCGG GCAAGAGCAA CTCGGTCGCC 17500 
GCATACACTA TTCTCAGAAT GACTTGGTTG AGTACTCACC AGTCACAGAA 17550 
AAGCATCTTA CGGATGGCAT GACAGTAAGA GAATTATGCA GTGCTGCCAT 17600 
AACCATGAGT GATAACACTG GGGCCAACTT ACTTCTGACA ACGATCGGAG 17650 
GACCGAAGGA GCTAACCGCT TTTTTGCACA ACATGGGGGA TCATGTAACT 17700 
CGCCTTGATC GTTGGGAACC GGAGCTGAAT GAAGCCATAC CAAACGACGA 17750 
GCGTGACACC ACGATGCCTG TAGCAATGGC AACAACGTTG CGCAAACTAT 17600 
TAACTGGCGA ACTACTTACT CTAGCTTCCC GGCAACAATT AATAGACTGG 17850 
ATGGAGGCGG ATAAAGTTGC AGGACCACTT CTGCGCTCGG CCCTTCCGGC 17900 
TGGCTGGTTT ATTGCTGATA AATCTGGAGC CGGTGAGCGT GGGTCTCGCG 17950 
GTATCATTGC AGCACTGGGG CCAGATGGTA AGCCCTCCCG TATCGTAGTT 16000 
ATCTACACGA CGGGGAGTCA GGCAACTATG GATGAACGAA ATAGACAGAT 1.050 
CGCTGAGATA GGTGCCTCAC IGATTAAGCA TTGGTAACIG TCAGACCAAG 16100 
TTTACTCATA TATACTTTAG ATTGATTTAA AACTTCATTT TTAATTTAAA 16150 
AGGATCTAGG TGAAGATCCT TTTTGATAAT CTCATGACCA AAATCCCTTA 1.200 
ACGTGAGTTT TCGITCCACT GAGCGTCAGA CCCCGIAGAA AAGATCAAAG 1.250 
GATCTTCTTG AGATCCTTTT TTTCTGCGCG TAATCTGCTG CTTGCAAACA 16300 
y^CCAC CGCTACCAGC GGTGGTTTGT TTGCCGGATC AAGAGCTACC 16350 
AACTCTTTTT CCGAAGGTAA CTGGCTTCAG CAGAGCGCAG ATACCAAATA 18400 
CTGTTCTTCT AGTGTAGCCG TAGTTAGGCC ACCACTTCAA GAACTCIGTA 18450 
CCACCGCCTA CATACCTCGC TCTGCTAATC CTGTTACCAG TGGCTGCTGC 18500 
CAGTGGCGAT AAGTCGTGTC TTACCGGGTT GGACTCAAGA CGATAGTTAC 18550 
CGGATAAGGC GCAGCGGTCG GGCTGAACGG GGGGTTCGTG CACACAGCCC 18600 
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AGCTTGGAGC GAACGACCT?. CACCGAACTG 
ATGAGAAAGC GCCACGCTTC CCGAAGGGAG 
TAAGCGGCAG GGTCGGAACA GGAGAGCGCA 
AACGCCTGGT ATCTTTATAG TCCTGTCGGG 
GCGTCGATTT TTGTGATGCT CGTCAGGGGG 
CCAGCAACGC GGCCTTTTTA CGGTTCCTGG 
CACATGTTCT TTCCTGCGTT ATCCCCTGAT 
CGCCTTTGAG TGAGCTGATA CCGCTCGCCG 
GCGAGTCAGT GAGCGAGGAA GCGGAAGAGC 
CTCCCCGCGC GTTGGCCGAT TCATTAATGC 
CGACTGGAAA GCGGG CAGTG AGCGCAACGC 
CTCATTAGGC ACCCCAGGCT TTACACTTTA 
TGTGGAATTG TGAGCGGATA ACAATTTCAC 
TGATTACGAA TTCGAATGGC CATGGGACGT 
CCGGGCC 
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WHAT IS CLAIMED IS: 



1. 



A recombinant shuttle vector comprising: 

(a) the DNA sequences of, or corresponding to, 
a portion of the genome of an auenovirus which comprises 
DNA sequences of, or corresponding to, the adenovirus 5 
and 3- inverted terminal repeats and packaging/enhancer 
domain necessary for replication and virion encapsidation 
in the absence of sequence encoding viral genes; 

(b) a selected gene operatively linked to 
regulatory sequences directing its expression said gene 
operatively linked to the DNA of (a) and capable of 
expression in a target cell in vivo or in vitro. 



2 The vector according to claim 1 wherein said 
DNA sequences (a) comprise the native adenovirus 5« 
inverted terminal repeats and packaging sequences. 

3. The vector according to claim 1 wherein said 

/ \ «e the native adenovirus 3* 

DNA sequences (a) comprise tne nan 

inverted terminal repeat sequences. 

4 . The vector according to claim 1 wherein said 
selected gene (b) is a reporter gene. 

5 The vector according to claim 4 wherein said 
-Porter gene is selected from the ^^S^- 
the genes encoding B-galactosidase, alkaline P v 
and green fluorescent protein. 

6. The vector according to claim 1 wherein said 
selected gene (b) is a therapeutic gene. 
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7. The vector according to claim 6 wherein said 
therapeutic gene is selected from the group consisting of 
a normal CFTR gene, a DMD Becker allele and a normal LDL 
gene. 

8. A crippled adenovirus helper virus comprising a 
modified adenovirus sequence in place of native 
adenovirus sequence map units 0-1, which modification 
reduces the packaging efficiency of said virus, said 
virus also containing selected adenovirus genes necessary 
to direct a productive viral infection. 

9. The helper virus according to claim 8 wherein 
said modified sequence comprises: 

i. a fragment of adenovirus map units 0-1; 

ii. a fragment of (i) containing a 5* inverted 
terminal repeat and between one to four selected 
packaging sequences, 

iii. a modified fragment of (i) containing at 
least one PAC consensus sequence in place of at least one 
native PAC sequence; and 

iv. a modified fragment of (ii) , wherein said 
native PAC sequences are mutated to contain modified 
sequences. 

10. The virus according to claim 8 wherein said 
modified sequence comprises Ad5 base pairs 1-269. 

• * 

11. The virus according to claim 8 wherein said 
sequence (ii) comprises Ad5 base pairs 1-321. 

12. The virus according to claim 8 wherein said 
helper adenovirus is conjugated to a poly-cation 
sequence • 
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13 . A method for producing a recombinant adenovirus 
which comprises transfecting a selected host cell with 

(a) a recombinant shuttle vector comprising 
i. the DNA sequences of, or 
corresponding to', a portion of the genome of an 
adenovirus which comprises adenovirus 5 • and 3 
elements necessary for replication and virion 
encapsidation in the absence of sequence encodxng viral 

genes; and ^ ^ ^ operatively Unked to 

regulatory sequences directing its expression, said gene 
linked to the DNA of (a, and capable of expression xn 
target cell in vivo or in vitro; and _ , . 

(b) a helper adenovirus comprising sufficient 

adenovirus gene sequences necessary for a P^ uc ^ ve 
viral infection, wherein said transfected host cell 
permits the formation of a recombinant virus comprising 
the DNA of (i) and (ii) in an adenoviral capsid, and 

isolating and purifying the recombinant virus from 

said cell. 

14 . The method according to claim 13, wherein said 
helper virus is a crippled helper virus comprising a 
»odif ied adenovirus sequence in place of na ^ e 
adenovirus sequence map units 0-1, which edification 
reduces the packaging efficiency of said helper virus, 
said helper virus also containing selected adenovirus 
Tenes necessary to direct a productive viral infection. 

15 The method according to claim 13 wherein said 
helper adenovirus is associated with a poly-cation 
sequence* 
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16. The method according to claim 13 wherein said 
vector is associated with said helper adenovirus 
conjugate in a single particle. 

17. The method according co claim 13 wherein said 
helper virus is an adenovirus sequence containing 
deletions of all or portions of the Ela and Elb genes. 

18. The method according to claim 13 wherein said 
helper virus is an adenovirus sequence containing 
deletions of all or a portion of the E3 gene. 

19. A recombinant adenovirus comprising 

i. the DNA of , or corresponding to, a 
portion of the genome of an adenovirus which comprises 
adenovirus 5' and 3 'cis-elements necessary for 
replication and virion encapsidation in the absence of 
sequence encoding viral genes; 

ii. a selected gene operatively linked to 
regulatory sequences directing its expression, said gene 
linked to the DNA of (a) and capable of expression in a 
target cell in vivo or in vitro; 

said DNA and gene encapsidated in an adenoviral 
capsid. 

20. The virus according to claim 19 wherein said 
viral capsid is a capsid of an adenovirus serotype 
selected from the group consisting of types 2, 4, 5, 7, 
12 and 40. 

21. The virus according to claim 19 wherein said 
selected gene is a CFTR gene, a DHD gene and an LDL gene. 
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22 . The use of a recombinant adenovirus according 
to claim 19 for the manufacture of a pharmaceutical 
composition suitable for delivering and integrating a 
selected gene into the chromosome of a target cell. 
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FIGURE 3A 
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FIGURE 3B 



TGGTGTGGGC CATAATTCAA TTCGCGCGTC CCGCAGCGCA GACCGTTTTC 
GCTCGGGAAG ACGTACGGGG TATACATGTC TGACAATGGC AGATCCCAGC 
GGTCAAAACA GGCGGCAGTA AGGCGGTCGG GATAGTTTTC TTGCGGCCCT 
AATCCGAGCC AGTTTACCCG CTCTGCTACC TGCGCCAGCT GGCAGTTCAG 
GCCAATCCGC GCCGGATGCG GTGTATCGCT CGCCACTTCA ACATCAACGG 
TAATCGCCAT TTGACCACTA CCATCAATCC GGTAGGTTTT CCGGCTGATA 
AATAAGGTTT TCCCCTGATG CTGCCACGCG TGAGCGGTCG TAATCAGCAC 
CGCATCAGCA AGTGTATCTG CCGTGCACTG CAACAACGCT GCTTCGGCCT 
GGTAATGGCC CGCCGCCTTC CAGCGTTCGA CCCAGGCGTT AGGGTCAATG 
CGGGTCGCTT CACTTACGCC AATGTCGTTA TCCAGCGGTG CACGGGTGAA 
CTGATCGCGC AGCGGCGTCA GCAGTTGTTT TTTATCGCCA ATCCACATCT 
GTGAAAGAAA GCCTGACTGG CGGTTAAATT GCCAACGCTT ATTACCCAGC 
TCGATGCAAA AATCCATTTC GCTGGTGGTC AGATGCGGGA TGGCGTGGGA 
CGCGGCGGGG AGCGTCACAC TGAGGTTTTC CGCCAGACGC CACTGCTGCC 
AGGCGCTGAT GTGCCCGGCT TCTGACCATG CGGTCGCGTT CGGTTGCACT 
ACGCGTACTG TGAGCCAGAG TTGCCCGGCG CTCTCCGGCT GCGGTAGTTC 
AGGCAGTTCA ATCAACTGTT TACCTTGTGG AGCGACATCC AGAGGCACTT 
CACCGCTTGC CAGCGGCTTA CCATCCAGCG CCACCATCCA GTGCAGGAGC 
TCGTTATCGC TATGACGGAA CAGGTATTCG CTGGTCACTT CGATGGTTTG 
CCCGGATAAA CGGAACTGGA AAAACTGCTG CTGGTGTTTT GCTTCCGTCA 
GCGCTGGATG CGGCGTGCGG TCGGCAAAGA CCAGACCGTT CATACAGAAC 
TGGCGATCGT TCGGCGTATC GCCAAAATCA CCGCCGTAAG CCGACCACGG 
GTTGCCGTTT TCATCATATT TAATCAGCGA CTGATCCACC CAGTCCCAGA 
CGAAGCCGCC CTGTAAACGG GGATACTGAC GAAACGCCTG CCAGTATTTA 
GCGAAACCGC CAAGACTGTT ACCCATCGCG TGGGCGTATT CGCAAAGGAT 
CAGCGGGCGC GTCTCTCCAG GTAGCGAAAG CCATTTTTTG ATGGACCATT 
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FIGURE 3C 
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FIGURE 3D 



AGTATCGGCC TCAGGAAGAT CGCACTCCAG CCAGCTTTCC GGCACCGCTT 
CTGGTGCCGG AAACCAGGCA AAGCGCCATT CGCCATTCAG GCTGCGCAAC 
TGTTGGGAAG GGCGATCGGT GCGGGCCTCT TC "CTATTAC GCCAGCTGGC 
CAAAGGGGGA TGTGCTGCAA GGCGATTAAG TTGGGTAACG CCAGGGTTTT 
CCCAGTCACG ACGTTGTAAA ACGACGGGAT CGCGCTTGAG CAGCTCCTTG 
CTGGTGTCCA GACCAATGCC TCCCAGACCG GCAACGAAAA TCACGTTCTT 
GTTGGTCAAA GTAAACGACA TGGTGACTTC TTTTTTGCTT TAGCAGGCTC 
TTTCGATCCC CGGGAATTGC GGCCGCGGGT ACAATTCCGC AGCTTTTAGA 
GCAGAAGTAA CACTTCCGTA CAGGCCTAGA AGTAAAGGCA ACATCCACTG 
AGGAGCAGTT CTTTGATTTG CACCACCACC GGATCCGGGA CCTGAAATAA 
AAGACAAAAA GACTAAACTT ACCAGTTAAC TTTCTGGTTT TTCAGTTCCT 
CGAGTACCGG ATCCTCTAGA GTCCGGAGGC TGGATCGGTC CCGGTCTCTT 
CTATGGAGGT CAAAACAGCG TGGATGGCGT CTCCAGGCGA TCTGACGGTT 
CACTAAACGA GCTCTGCTTA TATAGACCTC CCACCGTACA CGCCTACCGC 
CCATTTGCGT CAATGGGGCG GAGTTGTTAC GACATTTTGG AAAGTCCCGT 
TGATTTTGGT GCCAAAACAA ACTCCCATTG ACGTCAATGG GGTGGAGACT 
TGGAAATCCC CGTGAGTCAA ACCGCTATCC ACGCCCATTG ATGTACTGCC 
AAAACCGCAT CACCATGGTA ATAGCGATGA CTAATACGTA GATGTACTGC 
CAAGTAGGAA AGTCCCATAA GGTCATGTAC TGGGCATAAT GCCAGGCGGG 
CCATTTACCG TCATTGACGT CAATAGGGGG CGTACTTGGC ATATGATACA 
CTTGATGTAC TGCCAAGTGG GCAGTTTACC GTAAATACTC CACCCATTGA 
CGTCAATGGA AAGTCCCTAT TGGCGTTACT ATGGGAACAT ACGTCATTAT 
TGACGTCAAT GGGCGGGGGT CGTTGGGCGG TCAGCCAGGC GGGCCATTTA 
CCGTAAGTTA TGTAACGACC TGCAGGTCGA CTCTAGAGGA TCTCCCTAGA 
CAAATATTAC GCGCTATGAG TAACACAAAA TTATTCAGAT TTCACTTCCT 
CTTATTCAGT TTTCCCGCGA AAATGGCCAA ATCTTACTCG GTTACGCCCA 
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FIGURE 3F 

AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAGTGA GGCACCTATC 
TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC TCCCCGTCGT 
GTAGATAACT ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA 
TGATACCGCG AGACCCACGC TCACCGGCTC Ct *»ATTTATC AGCAATAAAC 
CAGCCAGCCG GAAGGGCCGA GCGCAGAAGT GGTCCTGCAA CTTTATCCGC 
CTCCATCCAG TCTATTAATT GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC 
CAGTTAATAG TTTGCGCAAC GTTGTTGCCA TTGCTACAGG CATCGTGGTG 
TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT CCCAACGATC 
AAGGCGAGTT ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT 
TCGGTCCTCC GATCGTTGTC AGAAGTAAGT TGGCCGCAGT GTTATCACTC 
ATGGTTATGG CAGCACTGCA TAATTCTCTT ACTGTCATGC CATCCGTAAG 
ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC TGAGAATAGT 
GTATGCGGCG ACCGAGTTGC TCTTGCCCGG CGTCAATACG GGATAATACC 
GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA AACGTTCTTC 
GGGGCGAAAA CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTTCGATGT 
AACCCACTCG TGCACCCAAC TGATCTTCAG CATCTTTTAC TTTCACCAGC 
GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA AATGCCGCAA AAAAGGGAAT 
AAGGGCGACA CGGAAATGTT GAATACTCAT ACTCTTCCTT TTTCAATATT 
ATTGAAGCAT TTATCAGGGT TATTGTCTCA TGAGCGGATA CATATTTGAA 
TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT TTCCCCGAAA 
AGTGCCACCT GACGTCTAAG AAACCATTAT TATCATGACA TTAACCTATA 
AAAATAGGCG TATCACGAGG CCCTTTCGTC TCGCGCGTTT CGGTGATGAC 
GGTGAAAACC TCTGACACAT GCAGCTCCCG GAGACGGTCA CAGCTTGTCT 
GTAAGCGGAT GCCGGGAGCA GACAAGCCCG TCAGGGCGCG TCAGCGGGTG 
TTGGCGGGTG TCGGGGCTGG CTTAACTATG CGGCATCAGA GCAGATTGTA 
CTGAGAGTGC ACCATATGGA CATATTGTCG TTAGAACGCG GCTACAATTA 
ATACATAACC TTATGTATCA TACACATACG ATTTAGGTGA CACTATA 
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FIGURE 5A 



GAATTCGCTA GCTAGCGGGG GAATACATAC CCGCAGGCGT AGAGACAACA 
TTACAGCCCC CATAGGAGGT ATAACAAAAT TAATAGGAGA GAAAAACACA 
TAAACACCTG AAAAACCCTC CTGCCTAGGC AAAATAGCAC CCTCCCGCTC 
CAGAACAACA TACAGCGCTT CACAGCGGCA GCCTAACAGT CAGCCTTACC 
AGTAAAAAAG AAAACCTATT AAAAAAACAC CACTCGACAC GGCACCAGCT 
CAATCAGTCA CAGTGTAAAA AAGGGCCAAG TGCAGAGCGA GTATATATAG 
GACTAAAAAA TGACGTAACG GTTAAAGTCC ACAAAAAACA CCCAGAAAAC 
CGCACGCGAA CCTACGCCCA GAAACGAAAG CCAAAAAACC CACAACTTCC 
TCAAATCGTC ACTTCCGTTT TCCCACGTTA CGTAACTTCC CATTTTAAGA 
AAACTACAAT TCCCAACACA TACAAGTTAC TCCGCCCTAA AACCTACGTC 
ACCCGCCCCG TTCCCACGCC CCGCGCCACG TCACAAACTC CACCCCCTCA 
TTATCATATT GGCTTCAATC CAAAATAAGG TATATTATTG ATGATGCTAG 
CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG 
GGGGTGGAGT TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG 
TAGTAGTGTG GCGGAAGTGT GATGTTGCAA GTGTGGCGGA ACACATGTAA 
GCGACGGATG TGGCAAAAGT GACGTTTTTG GTGTGCGCCG GTGTACACAG 
GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG TAAATTTGGG 
CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 
AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA 
GGGAGATCAG CCTGCAGGTC GTTACATAAC TTACGGTAAA TGGCCCGCCT 
GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA TGACGTATGT 
TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA TGGGTGGAGT 
ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 
AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA 
TGCCCAGTAC ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG 
TATTAGTCAT CGCTATTACC ATGGTGATGC GGTTTTGGCA GTACATCAAT 
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FIGURE 5C 

TGTACTGGAG GCTGAAGTTC AGATGTGCGG CGAGTTGCGT GACTACCTAC 
GGGTAACAGT TTCTTTATGG CAGGGTGAAA CGCAGGTCGC CAGCGGCACC 
GCGCCTTTCG GCGGTGAAAT TATCGATGAG CGTGGTGGTT ATGCCGATCG 
CGTCACACTA CGTCTGAACG TCGAAAACCC GAAACTGTGG AGCGCCGAAA 
TCCCGAATCT CTATCGTGCG GTGGTTGAAC TGCACACCGC CGACGGCACG 
CTGATTGAAG CAGAAGCCTG CGATGTCGGT TTCCGCGAGG TGCGGATTGA 

GTTGCTGATT CGAGGCGTTA 
AGGTCATGGA TGAGCAGACC 
AACAACTTTA ACGCCGTGCG 
GTACACGCTG TGCGACCGCT 
TTGAAACCCA CGGCATGGTG 
TGGCTACCGG CGATGAGCGA 
TAATCACCCG AGTGTGATCA 
GCGCTAATCA CGACGCGCTG 
CGCCCGGTGC AGTATGAAGG 

ATGTACGCGC 
GTC CATC AAA 



AAATGGTCTG CTGCTGCTGA ACGGCAAGCC 
ACCGTCACGA GCATCATCCT CTGCATGGTC 
ATGGTGCAGG ATATCCTGCT GATGAAGCAG 
CTGTTCGCAT TATCCGAACC ATCCGCTGTG 
ACGGCCTGTA TGTGGTGGAT GAAGCCAATA 
CCAATGAATC GTCTGACCGA TGATCCGCGC 
ACGCGTAACG CGAATGGTGC AGCGCGATCG 
TCTGCTCGCT GGGGAATGAA TCAGGCCACG 
TATCGCTGGA TCAAATCTGT CGATCCTTCC 
CGGCGGAGCC GACACCACGG CCACCGATAT TATTTGCCCG 
GCGTGGATGA AGACCAGCCC TTCCCGGCTG TGCCGAAATG 
AAATGGCTTT CGCTACCTGG AGAGACGCGC CCGCTGATCC TTTGCGAATA 
CGCCCACGCG ATGGGTAACA GTCTTGGCGG TTTCGCTAAA TACTGGCAGG 

GGACTGGGTG 



CGTTTCGTCA GTATCCCCGT TTACAGGGCG GCTTCGTCTG 
GATCAGTCGC TGATTAAATAn TGATGAAAAC GGCAACCCGT 
CGGCGGTGAT TTTGGCGATA CGCCGAACGA TCGCCAGTTC 
GTCTGGTCTT TGCCGACCGC ACGCCGCATC CAGCGCTGAC 



GGTCGGCTTA 
TGTATGAACG 
GGAAGCAAAA 



CACCAGCAGC AGTTTTTCCA GTTCCGTTTA TCCGGGCAAA CCATCGAAGT 
GACCAGCGAA TACCTGTTCC GTCATAGCGA TAACGAGCTC CTGCACTGGA 
TGGTGGCGCT GGATGGTAAG CCGCTGGCAA GCGGTGAAGT GCCTCTGGAT 
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AGACATGATA AGATACATTG ATGAGTTTGG ACAAACCACA ACTAGAATGC 
AGTGAAAAAA ATGCTTTATT TGTGAAATTT GTGATGCTAT TGCTTTATTT 
GTAACCATTA TAAGCTGCAA TAAACAAGTT AACAACAACA ATTGCATTCA 
TTTTATGTTT CAGGTTCAGG GGGAGGTGTG GGAGGTTTTT TCGGATCCTC 
TAGAGTCGAC GACGCGAGGC TGGATGGCCT TCCCCATTAT GATTCTTCTC 
GCTTCCGGCG GCATCGGGAT GCCCGCGTTG CAGGCCATGC TGTCCAGGCA 
GGTAGATGAC GACCATCAGG GACAGCTTCA AGGATCGCTC GCGGCTCTTA 
CCAGCCTAAC TTCGATCACT GGACCGCTGA TCGTCACGGC GATTTATGCC 
GCCTCGGCGA GCACATGGAA CGGGTTGGCA TGGATTGTAG GCGCCGCCCT 
ATACCTTGTC TGCCTCCCCG CGTTGCGTCG CGGTGCATGG AGCCGGGCCA 
CCTCGACCTG AATGGAAGCC GGCGGCACCT CGCTAACGGA TTCACCACTC 
CAAGAATTGG AGCCAATCAA TTCTTGCGGA GAACTGTGAA TGCGCAAACC 
AACCCTTGGC AGAACATATC CATCGCGTCC GCCATCTCCA GCAGCCGCAC 
GCGGCGCATC TCGGGCAGCG TTGGGTCCTG GCCACGGGTG CGCATGATCG 
TGCTCCTGTC GTTGAGGACC CGGCTAGGCT GGCGGGGTTG CCTTACTGGT 
TAGCAGAATG AATCACCGAT ACGCGAGCGA ACGTGAAGCG ACTGCTGCTG 
CAAAACGTCT GCGACCTGAG CAACAACATG AATGGTCTTC GGTTTCCGTG 
TTTCGTAAAG TCTGGAAACG CGGAAGTCAG CGCCCTGCAC CATTATGTTC 
CGGATCTGCA TCGCAGGATG CTGCTGGCTA CCCTGTGGAA CACCTACATC 
TGTATTAACG AAGCCTTTCT CAATGCTCAC GCTGTAGGTA TCTCAGTTCG 
GTGTAGGTCG TTCGCTCCAA GCTGGGCTGT GTGCACGAAC CCCCCGTTCA 
GCCCGACCGC TGCGCCTTAT CCGGTAACTA TCGTCTTGAG TCCAACCCGG 
TAAGACACGA CTTATCGCCA CTGGCAGCAG CCACTGGTAA CAGGATTAGC 
AGAGCGAGGT ATGTAGGCGG TGCTACAGAG TTCTTGAAGT GGTGGCCTAA 
CTACGGCTAC ACTAGAAGGA CAGTATTTGG TATCTGCGCT CTGCTGAAGC 
CAGTTACCTT CGGAAAAAGA GTTGGTAGCT CTTGATCCGG CAAACAAACC 
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FIGURE 7A 

TCTTCCGCTT CCTCGCTCAC TGACTCGCTG CGCTCGGTCG TTCGGCTGCG 50 

GCGAGCGGTA TCAGCTCACT CAAAGGCGGT AATACGGTTA TCCACAGAAT 100 

CAGGGGATAA CGCAGGAAAG AACATGTGAG CAAAAGGCCA GCAAAAGGCC 150 

AGGAACCGTA AAAAGGCCGC GTTGCTGGCG TTTTTCCATA GGCTCCGCCC 200 

CCCTGACGAG CATCACAAAA ATCGACGCTC AAGTCAGAGG TGGCGAAACC 250 

CGACAGGACT ATAAAGATAC CAGGCGTTTC CCCCTGGAAG CTCCCTCGTG 300 

CGCTCTCCTG TTCCGACCCT GCCGCTTACC GGATACCTGT CCGCCTTTCT 350 

CCCTTCGGGA AGCGTGGCGC TTTCTCATAG CTCACGCTGT AGGTATCTCA 400 

GTTCGGTGTA GGTCGTTCGC TCCAAGCTGG GCTGTGTGCA CGAACCCCCC 450 

GTTCAGCCCG ACCGCTGCGC CTTATCCGGT AACTATCGTC TTGAGTCCAA 500 

CCCGGTAAGA CACGACTTAT CGCCACTGGC AGCAGCCACT GGTAACAGGA 550 

TTAGCAGAGC GAGGTATGTA GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG 600 

CCTAACTACG GCTACACTAG AAGAACAGTA TTTGGTATCT GCGCTCTGCT 650 

GAAGCCAGTT ACCTTCGGAA AAAGAGTTGG TAGCTCTTGA TCCGGCAAAC 700 

AAACCACCGC TGGTAGCGGT GGTTTTTTTG TTTGCAAGCA GCAGATTACG 750 

CGCAGAAAAA AAGGATCTCA AGAAGATCCT TTGATCTTTT CTACGGGGTC 800 

TGACGCTCAG TGGAACGAAA ACTCACGTTA AGGGATTTTG GTCATGAGAT 850 

TATCAAAAAG GATCTTCACC TAGATCCTTT TAAATTAAAA ATGAAGTTTT 900 

AAATCAATCT AAAGTATATA TGAGTAAACT TGGTCTGACA GTTACCAATG 950 

CTTAATCAGT GAGGCACCTA TCTCAGCGAT CTGTCTATTT CGTTCATCCA 1000 

TAGTTGCCTG ACTCCCCGTC GTGTAGATAA CTACGATACG GGAGGGCTTA 1050 

CCATCTGGCC CCAGTGCTGC AATGATACCG CCAGACCCAC GCTCACCGGC 1100 

TCCAGATTTA TCAGCAATAA ACCAGCCAGC CGGAAGGGCC GAGCGCAGAA 1150 

GTGGTCCTGC AACTTTATCC GCCTCCATCC AGTCTATTAA TTGTTGCCGG 1200 

GAAGCTAGAG TAAGTAGTTC GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC 1250 

CATTGCTACA GGCATCGTGG TGTCACGCTC GTCGTTTGGT ATGGCTTCAT 1300 



SUBSUME SHEET (HH£ 26) 



PCT/US95/14017 

WO 96/13597 

18/45 
FIGURE 7B 



TCAGCTCCGC TTCCCAACGA TCAAGGCGAG TTACATGATC CCCCATGTTG 
TGCAAAAAAG CGGTTAGCTC CTTCGGTCCT CCGATCGTTG TCAGAAGTAA 
GTTGGCCGCA GTGTTATCAC TCATGGTTAT GGCAGCACTG CATAATTCTC 
TTACTGTCAT GCCATCCGTA AGATGCTTTT CTGTGACTGG TGAGTACTCA 
ACCAAGTCAT TCTGAGAATA GTGTATGCGG CGACCGAGTT GCTCTTGCCC 
GGCGTCAATA CGGGATAATA CCGCGCCACA TAGCAGAACT TTAAAAGTGC 
TCATCATTGG AAAACGTTCT TCGGGGCGAA AACTCTCAAG GATCTTACCG 
CTGTTGAGAT CCAGTTCGAT GTAACCCACT CGTGCACCCA ACTGATCTTC 
AGCATCTTTT ACTTTCACCA GCGTTTCTGG GTGAGCAAAA ACAGGAAGGC 
AAAATGCCGC AAAAAAGGGA ATAAGGGCGA CACGGAAATG TTGAATACTC 
ATACTCTTCC TTTTTCAATA TTATTGAAGC ATTTATCAGG GTTATTGTCT 
CATGAGCGGA TACATATTTG AATGTATTTA GAAAAATAAA CAAATAGGGG 
TTCCGCGCAC ATTTCCCCGA AAAGTGCCAC CTGACGTCTA AGAAACCATT 
ATTATCATGA CATTAACCTA TAAAAATAGG CGTATCACGA GGCCCTTTCG 
TCTCGCGCGT TTCGGTGATG ACGGTGAAAA CCTCTGACAC ATGCAGCTCC 
CGGAGACGGT CACAGCTTGT CTGTAAGCGG ATGCCGGGAG CAGACAAGCC 
CGTCAGGGCG CGTCAGCGGG TGTTGGCGGG TGTCGGGGCT GGCTTAACTA 
TGCGGCATCA GAGCAGATTG TACTGAGAGT GCACCATAAA ATTGTAAACG 
TTAATATTTT GTTAAAATTC GCGTTAAATT TTTGTTAAAT CAGCTCATTT 
TTTAACCAAT AGGCCGAAAT CGGCAAAATC CCTTATAAAT CAAAAGAATA 
GCCCGAGATA GGGTTGAGTG TTGTTCCAGT TTGGAACAAG AGTCCACTAT 
TAAAGAACGT GGACTCCAAC GTCAAAGGGC GAAAAACCGT CTATCAGGGC 
GATGGCCCAC TACGTGAACC ATCACCCAAA TCAAGTTTTT TGGGGTCGAG 
GTGCCGTAAA GCACTAAATC GGAACCCTAA AGGGAGCCCC CGATTTAGAG 
CTTGACGGGG AAAGCCGGCG AACGTGGCGA GAAAGGAAGG GAAGAAAGCG 
AAAGGAGCGG GCGCTAGGGC GCTGGCAAGT GTAGCGGTCA CGCTGCGCGT 
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FIGURE 7D 

AACCTACAGG TGGGGTCTTT CATTCCCCCC TTTTTCTGGA GACTAAATAA 
AATCTTTTAT TTTATCTATG GCTCGTACTC TATAGGCTTC AGCTGGTGAT 
ATTGTTGAGT CAAAACTAGA GCCTGGACCA CTGATATCCT GTCTTTAACA 
AATTGGACTA ATCGCGGGAT CAGCCAATTC CATGAGCAAA TGTCCCATGT 
CAACATTTAT GCTGCTCTCT AAAGCCTTGT ATCTTGCATC TCTTCTTCTG 

CAGAGCAGCA ATCTGGGGCT TAGACTTGCA CTTGCTTGAG 
GAAAGAGCTT CACCCTGTCG GAGGGGCTGA TGGCTTGCCG 
CTCTCGTTCA GCAGTTTCTG GATGGAATCG TACTGCCGCA 
TTCTATGACC AAAAATTGTT GGCATTCCAG CATTGCTTCT 
CACAGAGAAT TACTGTGCAA TCAGCAAATG CTTGTTTTAG 
ATTATTTGGT ATGTTACTGG ATCCAAATGA GCACTGGGTT 
CAAGATCTTC GCCTTACTGA GAACAGATCT AGCCAAGCAC 

CCACAAGGAC 

AGCCCAACCT 
ATAGGGATCC 
GTGGTATCAC 
CAAGACACAC 
AAAAGCTGAT 
CCACCCTCTG 
TTTCCACCTT 
CCCTGAGGGC 
TAACTTTCGA 
TTACCTTCTG 
CATCAAGCTA 
TCATGATATT 



TCTCCTCTTT 
TTCCGGTGGG 
GAAGAGGCTC 

CTTTGTTCTC 
ATCCTGTGTT 
AGTTCTTCTA 
CATCAAGCAG 

ATCAACTGCT TGTGGCCATG GCTTAGGACA CAGCCCCCAT 
AAAGTCAAGC TTCCCAGGAA ACTGTTCTAT CACAGATCTG 
CATCTGCAAC TTTCCATATT TCTTGATCAC TCCACTGTTC 
AAGTTTTTTC TAAATGTTCC AGAAAAAATA AATACTTTCT 
TCCAAAGGCT TTCCTCCACT GTTGCAAAGT TATTGAATCC 
CATCGATCTG GATTTCTCCT TCAGTGTTCA GTAGTCTCAA 
AACAAAGTAC TCTTCCCTGA TCCAGTTCTT CCCAAGAGGC 
GCCAGGACTT ATTGAGAAGG AAATGTTCTC TAATATGGCA 
CTGTGTATTT TGCTGTGAGA TCTTTGACAG TCATTTGGCC 
CAGATGTCAT CTTTCTTCAC GTGTGAATTC TCAATAATCA 
GAGTTGGCCA TTCTTGTATG GTTTGGTTGA CTTGGTAGGT 
TTGGCATGTC AATGAACTTA AAGACTCGGC TCACAGATCG 
TCCACATCTA TGCTGGAGTT TACAGCCCAC TGCAATGTAC 



CATGGCTAAA GTCAGGATAA TACCAACTCT TCCTTCTCCT TCTCCTGTTG 
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CAGAAGGTCA TCCAAAATTG 
CACCTGCTTT CAACGTGTTG 
TGTAACATTT TGTGGTGTAA 
CAGTGGTAGA CCTCTGAAGA 
CCACGTAAAT GTAAAACACA 
GCATAGCTGT TATTTCTACT 
AGTGTTTCCA AGGAGCCACA 
G AAAAATT AC TAAGCACCAA 
GTAATATATC GAAGGTATGT 

* 

CATATCATCA AAAAAGCACT 
TTATTTCCAA GCCAGTTTCT 
TCAGTCAAGT TTGCCTGAGG 
TGTCTTTCGG TGAATGTTCT 
TCAGGACAGA CTGCCTCCTT 
ACGCTGATGC GAGGCAGTAT 
GGACAGCCTT CTCTCTAAAG 
TTTGTAAGGG AGTCTTTTGC 
GGATTGAGAA TAGAATTCTT 
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FIGURE 7E 

ACAGCAATGA AGAAGATGAC 
GCGCAGTGTT GACAGGTACA 
CTTTGTGGAA CAGAGTTTCA 
GTCCATAGTC CTTTTAAGCT 
GCCTTCAGAT TCCAGTTGTT 
CTCTCAACAT AATAAAAGCC 
TAGGGTTGTA AAACTGCGAC 
CAACTGGATG AAGTCAAATA 
CTATATCTTT GGAGAATCTA 
AGGGTTGACA TAGGTGCTTG 
AATTTTCGAC ACTGTGATTA 
ATCCCATAGC AAGCAAAGTG 
TAATACGAAC TGGTGCTGGT 
ATGAGTACTA TTCCCTTTGT 
GCACAACCAA AGAAGCAGCC 
ATTAGCACAA AAATTAAGCT 
GTTCCATGTA GTCACTGCTG 
CCTTTAAGTC TTCTTCGTTA 
TGAGATAACC TTCTTGAATA 
GGCCAGTGAC ACTTTTCGTG 
GACCTTGGTT AACTGAGTGT 
CGTGCCTGAA GCGTGGGGCC 
CGCCTCTCCC TGCTCAGAAT 
GCTCATCAGA ATCCTCTTCG 
ACAATGGAAA ATTTTCGTAT 
CCTTTTTTCC CCAAACTCTC 
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FIGURE 7F 

AAAAGATTGT TTTTTTGTTT CTGTCCAGGA GACAGGAGCA TCTCCTTCTA 
ATGAGAAACG GTGTAAGGTC TCAGTTAGGA TTGAATTTCT TCTTTCTGCA 
CTAAATTGGT CGAAAGAATC ACATCCCATG AGTTTTGAGC TAAAGTCTGG 
CTGTAGATTT TGGAGTTCTG AAAATGTCCC ATAAAAATAG CTGCTACCTT 
CATGCAAAAT TAATATTTTG TCAGCTTTCT TTAAATGTTC CATTTTAGAA 
GTGACCAAAA TCCTAGTTTT GTTAGCCATC AGTTTACAGA CACAGCTTTC 

GGAGAGTCTA 

AGAAATTCTT 



AAATATTTCT TTTTCTGTTA AAACATCTAG GTATCCAAAA 
ATAAATACAA ATCAGCATCT TTGTATACTG CTCTTGCTAA 
GCTCGTTGAC CTCCACTCAG TGTGATTCCA CCTTCTCCAA GAACTATATT 
GTCTTTCTCT GCAAACTTGG AGATGTCCTC TTCTAGTTGG CATGCTTTGA 

GATGATATTT 

AGAATGAAAT 
TCTCCCATAA 
AGCAACCGCC 
GGACAGGAGT 



TGACGCTTCT GTATCTATAT TCATCATAGG AAACACCAAA 
TCTTTAATGG TGCCAGGCAT AATCCAGGAA AACTGAGAAC 



TCTTCCACTG 
TCATCATTAG 



TGCTTAATTT TACCCTCTGA 
AAGTGAAGTC TTGCCTGCTC 



AGGCTCCAGT 
CAGTGGATCC 



AACAACTGTC CTCTTTCTAT CTTGAAATTA ATATCTTTCA 
ACCAAGAAGT GAGAAATTAC TGAAGAAGAG 
TTTTTCTATT GTTATTGTTT TGTTTTGCTT 
CCCTCCTCCC AGAAGGCTGT TACATTCTCC 
TAAGTTATAT TCCAATGTCT TATATTCTTG 
TTTTGTTTAT TGCTCCAAGA GAGTCATACC 
AATTGCCGAG TGACCGCCAT GCGCAGAACA 
GAATATTTTC CGGAGGATGA TTCCTTTGAT 
ATAAAAACAC CACAAAGAAC CCTGAGAAGA 
TATCTCACAT AGGCTGCCTT CCGAGTCAGT 
GTTTTCAATC ATTTTTTCCA TTGCTTCTTC 
ATTGGATGTT CTCGATCATT TCTGAGGTAA TCACAAGTCT TTCACTGATC 



GCTGTCATCA CCATTAGAAG 
TCTCAAATAA TTCCCCAAAT 
ATCACTACTT CTGTAGTCGT 
CTTTTGTAAG AAATCCTGTA 
ATGTTTGTAC AGCCCAGGGA 
ATGCAGAATG AGATGGTGGT 
TAGTGCATAG GGAAGCACAG 
AGAAGGCTGA GCTATTGAAG 
TTCAGTTCTG TTTGTCTTAA 
CCAGCAGTAT GCCTTAACAG 
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FIGURE 7H 

TTACCGTAAA TACTCCACCC ATTGACGTCA ATGGAAAGTC CCTATTGGCG 9150 

TTACTATGGG AACATACGTC ATTATTGACG TCAATGGGCG GGGGTCGTTG 9200 

GGCGGTCAGC CAGGCGGGCC ATTTACCGTA AGTTATGTAA CGACCTGCAG 9250 

GCTGATCTCC CTAGACAAAT ATTACGCGCT ATGAGTAACA CAAAATTATT 9300 

CAGATTTCAC TTCCTCTTAT TCAGTTTTCC CGCGAAAATG GCCAAATCTT 9350 

ACTCGGTTAC GCCCAAATTT ACTACAACAT CCGCCTAAAA CCGCGCGAAA 9400 

ATTGTCACTT CCTGTGTACA CCGGCGCACA CCAAAAACGT CACTTTTGCC 9450 

ACATCCGTCG CTTACATGTG TTCCGCCACA CTTGCAACAT CACACTTCCG 9500 

CCACACTACT ACGTCACCCG CCCCGTTCCC ACGCCCCGCG CCACGTCACA 9550 

AACTCCACCC CCTCATTATC ATATTGGCTT CAATCCAAAA TAAGGTATAT 9600 

TATTGATGAT GCTAGCATGC GCAAATTTAA AGCGCTGATA TCGATCGCGC 9650 

GCAGATCTGT CATGATGATC ATTGCAATTG GATCCATATA TAGGGCCCGG 9700 

GTTATAATTA CCTCAGGTCG ACGTCCCATG GCCATTCGAA TTCGTAATCA 9750 

TGGTCATAGC TGTTTCCTGT GTGAAATTGT TATCCGCTCA CAATTCCACA 9800 

CAACATACGA GCCGGAAGCA TAAAGTGTAA AGCCTGGGGT GCCTAATGAG 9850 

TGAGCTAACT CACATTAATT GCGTTGCGCT CACTGCCCGC TTTCCAGTCG 9900 

GGAAACCTGT CGTGCCAGCT GCATTAATGA ATCGGCCAAC GCGCGGGGAG 9950 

9972 

AGGCGGTTTG CGTATTGGGC GC 
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FIGURE 12A 

CCAATTCCAT CATCAATAAT ATACCTTATT TTGGATTGAA GCCAATATGA 
TAATGAGGGG GTGGAGTTTG TGACGTGGCG CGGGGCGTGG GAACGGGGCG 
GGTGACGTAG GTTTTAGGGC GGAGTAACTT GTATGTGTTG GGAATTGTAG 
TTTTCTTAAA ATGGGAAGTT ACGTAACGTG GGAAAACGGA AGTGACGATT 
TGAGGAAGTT GTGGGTTTTT TGGCTTTCGT TTCTGGGCGT AGGTTCGCGT 
GCGGTTTTCT GGGTGTTTTT TGTGGACTTT AACCGTTACG TCATTTTTTA 
GTCCTATATA TACTCGCTCT GCACTTGGCC CTTTTTTACA CTGTGACTGA 
TTGAGCTGGT GCCGTGTCGA GTGGTGTTTT TTTAATAGGT TTTCTTTTTT 
ACTGGTAAGG CTGACTGTTA GGCTGCCGCT GTGAAGCGCT GTATGTTGTT 
CTGGAGCGGG AGGGTGCTAT TTTGCCTAGG CAGGAGGGTT TTTCAGGTGT 
TTATGTGTTT TTCTCTCCTA TTAATTTTGT TATACCTCCT ATGGGGGCTG 
TAATGTTGTC TCTACGCCTG CGGGTATGTA TTCCCCCCAA GCTTGCATGC 
CTGCAGGTCG ACTCTAGAGG ATCCGAAAAA ACCTCCCACA CCTCCCCCTG 
AACCTGAAAC ATAAAATGAA TGCAATTGTT GTTGTTAACT TGTTTATTGC 
AGCTTATAAT GGTTACAAAT AAAGCAATAG CATCACAAAT TTCACAAATA 
AAGCATTTTT TTCACTGCAT TCTAGTTGTG GTTTGTCCAA ACTCATCAAT 
GTATCTTATC ATGTCTGGAT CCCCGCGGCC GCTCTAGAAC TAGTGGATCC 
CCCGGGCTGC AGGAATTCCG TAACATAACT GCGTGCTTTA TTGAGATACA 
CAGTAAAGCA GTAATATAAT ACAATAGTAA GGCATATATT TGGTGAAATC 
TGATATGTTG TGAAAATGCA GTAAAACTGA AGTTTAAAAA AATAATTAGT 
AAATGTTACA GTGTTGGTGT TAAAACACAA TCTATTATGA TACTCAAGTA 
AGAGTCCAGT ACCTGGAGAC AATGATGATA CATGCCATGT GATGATTATG 
CTTCAGTTAC ACTGATTATG ATTTACACTT TAATACTTGA TGGTTATAAA 
GAACATGAAA TGATGTCCAA ATTATGCTTA AAATCAGCAA TAAAGCTCTC 
AGTTTTTATT CAAATATTTT GATAGATTCA CTCCAGAACT AATATCTAAA 
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CCATTCTTTT 


ACAGAGTTGT 


TGCTCCCCTC 


ATATAAATTG 


ACTGAGGAGC 


2300 


CGCAACCTTT 


AGCTCCTACC 


ATCTTCCTCC 


TACTGTCTGG 


GAGTTAAAAA 


2350 


TGTCATCTGA 


TGTTCTATTG 


CAGAAACATC 


ATTAAATATA 


ACCCAACAGT 


2400 


AGGAAGTTGA 


ATATATCAGC 


CAACAAATTA 


CTATGATAGT 


AAGTCCTGTG 


2450 


TATTCATTCG 


CATGTTCCTT 


GAAAAAAATG 


AATCCTCTAG 


CTCTCAGTGG 


2500 
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FIGURE 12C 



PCI7US95/14017 



TAGACAATAT TTTAGTGTGG 
AAATTCACTC TTGCCCAAGC 
CATAGGAAG TGGCAAGAAC 
CTATATACCT ATAGTATTTA 
GAAAGCCCAT AGCCAGCCCC 
TGAAGCAGTC ATTTGGTGGG 



AAAGTTTAAA ACTAGAAACA TCTGGAGCCC 
CGGTAGTCTC CTGGCTTTGG GCTCCAGGGA 
AGATAAGCCC AGATGACTAG AAGCAATTTC 
ATTTGAAGAA GTAACTTCAT ATCTATTTAT 
TATACTTGTA GACATATAGA TGTATAAAAT 
ACTCAGTCAA CAATTCTCAA AAGAGCAATA 

GTTCGTATGC AAGAAAATAA AAAAACGTCA TGAATTCCAT ATGAATACCA 
CGCTAAAGTA ATGCAAAACA ATGTGCTGCC TCAGTGTGTG TGTGTGTGTG 
TGTGTGTGTG GTGGGTTCGT GCATGTATGT GTGCGTGTGT GTGTGTGTGT 
GTGTGTGTGT GTGTGTGTGC GTGTGTGTTT GTTTAGGGGT TTTTATAAAC 
AACTTTTTTT ATAAAGCACA CTTTAGTTTA CAATCTCTCT TTATAACTGT 
TATAAATTTT TAAACAACCC AAAATGCGTT CCATATAAAG AAATGGCAAG 
TTATTTAGCT ATCAAGATTT TACATGTTTT CTTTTAACTT TTTTGTACAA 
TTGCATAGAC GTGTAAAACC 
AGAAACTACT GAAATCTACA 
AGATTTTATT TCTTGTAAAC 
ATATTATAAA AACCATGCGG 
TCCTTCTTCA TCTGTCATGA 
AATCATCTGC CATGTGGAAA AGGCTTCCTA CATTGTGTCC TCTCTCATTG 
GCTTTCCGGG GGCATTTCTT CCTCTTGAAC TAGGGAAGGA GTTGTTGAGT 
TGCTCCATCA CTTCTTCTAA CCCTGTGCTT GTGTCCTGGG GAGGACTCAG 
AAGATCTTCC TCACCCATAG ATTCTGAAGT TTGACTGCCA ACCACTCGGA 
GCAGCATAGG CTGACTGCTA TCTGACCTCT GCAGAGAGGT GGAAGGAGAG 
GACACCGTGG TGCCATTCAC CTTAGCTTCA GCCTGGGGCT GCTCCAGGAG 
CTGTCTCAGT CTATGTAACT GAGACTCCAG CTGTTTATTG TGGTCTTCCA 



TGCCATTGTT AACAAAACAA TAACAGACTT 
GTATAGTACC ACTACCCTTC ACAAAAATAT 
TCTTACTGTC TAATCCTCTT TGTTGTACGA 
GAATCAGGAG TTGTAAAACA TTTATTCTGC 
CTGAAACTAA GGACTCCATC GCTCTGCCCA 
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GGATTTGCAT 


CCTGGCTTCC 


AGGCGTCCTT 


TGTGTTGGCG 


CAGTAGCTTA 


3800 


GCCTCAGCAA 


TGAGCTCAGC ATCCCTGGGA 


CTCTGAGGAG 


A GGTGGG CAT 


3850 


CATCTCAGGA 


GGAGATGGCA 


GTGGAGACAG 


GCCTTTATGC 


TCATGCTGCT 


3900 


GCTTCAGGCG 


ATCATATTCT 


GCTTGCAGAT 


TCCTGTTTTC 


TTCCTCAAGA 


3950 


TCTGCTAGGA 


TTCTCTCTAG 


CTCCCCTCTT 


TCCTCACTCT 


CTAAGGAAAT 


4000 


CAAGATCTGG 


GCAGGACTAC 


■ 

GAGGCTGGCT 


CAGGGGGGAG 


TCCTGGTTCA 


4050 


AACTTTGGCA 


GTAATGCTGG 


ATTAACAAAT 


GTTCATCATC 


TATGCTCTCA 


4100 


TTAGGAGAGA 


TGCTATCATT 


TAGATAAGAT 


CCATTGCTGT 


TTTCCATTTC 


4150 


TGCTAGCCTG 


CTAGCATAAT 


GTTCAATGCG 


TGAATGAGTA 


TCATCGTGTG 


4200 


AAAGCTGGGG 


GGACGAGGCA 


GGCGCAGAAT 


CTACTGGCCA 


GAAGTTGATC 


4250 


AGAGTAACGG 


GAGTTTCCAT 


GTTGTCCCCC 


TCTAACACAG 


TCTGCACTGG 


4300 


CAGGTAGCCC 


ATTCGGGGAT 


GCTTCGCAAA 


ATACCTTTTG 


GTTCGAAATT 


4350 


TGTTTTTTAG 


TACCTTGGCG 


AAGTCGCGAA 


CATCTTCTCC 


GGATGTAGTC 


4400 


GGAGTGCAAT 


ACTCTACCAT 


GGGGTAGTGC 

i 


ATTTTATGGC 


CCTTTGCAAC 


4450 


TCGGCCAGAA 


AAAAAG CAAC 


TTTGGCAGAT 


GTCATAATTA 


AAATGCTTTA 


4500 


GG CTTCTGTA 


CCTGAATCCA 


ATGATTGGAC 


ACTCCTTACA 


GATGTTACAC 


4550 


TTGGCTTGAT 


GCTTGGCAGT 


TTCAGCAGCA 


GCCACTCTGT 


GCAAGACGGG 


4600 


CAGCCACACC 


ATAGACTGGG 


GTTCCAGGCG 


CATCCAGTCA 


AGGAAGAGAG 


4650 


CAGCTTCAAT 


CTCAGGTTTA 


TTATTGGCAA 


ATTGGAAGCA 


GCTCCTGACA 


A ~J f\ f\ 

4700 


CTCGGCTCAA 


TGTTACTGCC 


CCCAAAGGAA 


GCAACTTCAC 


CCAACTGTCT 


4750 


TGGGATTTGA 


ATAGAATCAT 


GCAGAAGAAG 


ACCCAGCCTA 


CGCTGGTCAC 


4800 


AAAAGCCAGT 


TGAACTTGCC 


ACTTGCTTGA 


AAAGGTATCT 


GTACTTGTCT 


4850 


TCCAAGTGTG 


CTTTACACAG 


AGAAATGATG 


CCAGTTTTAA 


AAGACAGGAC 


4900 


ACGGATCCTC CCTGTTCGTC CCGTATCATA AACATTGAGA AGCCAGTTGA 


4950 


GACACATATC 


CACACAGAGA 


GGGACATTGA 


CCAGATTGTT 


GTGCTCTTGC 


5000 


TCCAGACGAT 


CATAAATTGT 


AGTCAAACAG 


TTAATTATCT 


GCAGGATATC 


5050 
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FIGURE 12E 

CATGGGCTGG TCATTTTGCT TGAGGTTGTG CTGGTCCAGG GCATCACATG 
CAGCTGACAG GCTCAAGAGA TCCAAGCAAA GGGCCTTCTG GAGCCTTCTG 
AGCTTCATGG CAGTCCTATA CGCGGAGAAC CTGACATTAT TCAGGTCAGC 
TAAAGACTGG TAGAGCTCTG TCATTTTGGG GTGGTCCCAA CAAGTGGTTT 
GGGTCTCGTG GTTGATATAG TAGGGCACTT TGTTTGGTGA GATGGCTCTC 
TCCCAGGGAC CCTGAACTGA AGTGGAAAGG AAGTGCTGGG ATGCAGGACC 
AAAGTCCCTG TGGGCTTCAT GCAGCTGTCT GACACGGTCC TCCACAGCCA 
CCTGTAGAAG CCTCCATCTG GTATTCAGAT CTTCCAAAGT GCTGAGGTTA 
TAAGGTGAGA GCTGAATGCC CAGTGTGGTC AGCTGATGTG CAAGGTCATT 
GACACGATTG ACATTCTCTT TAAGAGGTGC AATTTCTCCC CGAAGTGCCT 
TGACTTTTTC AAGGTGATCT TGCAGAGAGT CAATGAGGAG ATCCCCCACT 
GGCTGCCAGG ATCCCTTGAT CACCTCAGCT TGGCGCAACT TGAGGTCCAG 
TTCATCGGCA GCTTCCTGAA GTTCCTGGAG TCTTTCAAGA GCTTCATCTA 
TTTTTCTCTG CCAATCAGCT GAGCGCAGGT TCAATTTGTC CCATTCAGCG 
TTGACCTCTT CAGCCTGCTT TCGTAGGAGC CGAGTGACAT TCTGAGCTCT 
TTCTTCAGGA GGCAGTTCTC TGGGCTCCTG GTAGAGTTTC TCTAGTCCTT 
CCAAAGGCTG CTCTGTCAGA AATATTCTCA CAGTCTCCAG AGTACTCATG 
ATTACAGGTT CTTTAGTTTT CAATTCCCTC TTGAAGGCCC TATGTATATC 
ATTCTGCTTC TGAACTGCTG GGAAATCACC ACCGATGGGT GCCTGACGGC 
TCAGTTCATC ATCTTTCAGC TGTAGCCAAA CAAGAAGTTC CTGAAGAGAA 
AGATGCAAAC GCTTCCACTG GTCAGAACTT GCTTCCAAAT GGGACCTAAT 
GTTGAGAGAC TTTTTCTGAA GTTCACTCCA CTTGAAATTC ATGTTATCCA 
AACGTCTTTG TAACAGGGGT GCTTCATCCG AACCTTCCAG GGATCTCAGG 
ATTTTTTGGC CATTTTCATC AAGATTGTGA TAGATATCTG TGTGAGTTTC 
AATTTCTCCT TGGAGATCTT GCCATGGTTT CATCAGCTCT CTGACTCCCC 
TGGAGTCTTC TAGGAGCTTC TCCTTACGGG AAGCGTCCTG TAGGACATTG 
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GCAGTTGTTT 


CTGCTTCCGT 


AATCCAGGAA 


AGAAACTTCT 


• 

CCAGGTCCAG 


6400 


AGGGAACTGC 


TGCAGTAATC 


TATGAGTTTC 


TTCCAAAGCA 


GCCTCTTGCT 


6450 


CACTTACTCT 


TTTATGAATG 


TTTCCCCAAG 


AAGTATTGAT 


ATTCTCTGTT 


6500 


ATCATGTGTA 


CTTTTCTGGT 


ATCATCAGCA 


GAATAGTCCC 


GAAGAAGTTT 


6550 


CAGTGCCAAA 


TCATTTGCCA 


CGTCTACACT 


TATCTGCCGT 


TGACGGAGGT 


6600 


CTTTGGCCAA 


CTGCTTGGTT 


TCTGTGATCT 


TCTTTTGGAT 


TGCATCTACT 


6650 


GTGTGAGGAC 


CTTCTTTCCA 


TGAGTCAAGC 


TTGCCTCTGA 


CCTGTCCTAT 


6700 


GACCTGTTCG 


GCTTCTTCCT 


TAGCTTCCAG 


CCATTGTGTT 


GAATCCTTTA 


6750 


ACATTTCATT 


CAACTGTTGT 


CTCCTGTTCT 


GCAGCTGTTC 


TTGAACCTCA 


6800 


TCCCACTGAA 


TCTGAATTCT 


TTCAATTCGA 


TCAGTAATGA 


TTGTTCTAGC 


6850 


TTCTTGATTG 


CTGGTTTTGT 


TTTTCAAATT 


CTGGGCAGCA 


GTAATGAGTT 


6900 


CTTCCAATTG 


GGGGCGTCTC 


TGTTCCAAAT 


CTTGCAGTGT 


TGCCTTCTGT 


6950 


TTGATGATCA 


TTTCATTGAT 


GTCTTCCAGA 


TCACCCACCA 


TCACTCTCTG 


7000 


TGATTTTATA 


ACTCGATCAA 


GCAGAGACAG 


CCAGTCTGTA 


AGTTCTGTCC 


7050 


AAGCTCGGTT 


GAAGTCTGCC 


AGTGCAGGTA 


CCTCCAACAG 


CAAAGAAGAT 


7100 


GGCATTTCTA 


GTTTGGAGAT 


GACAGTTTCC 


TTAGTAACCA 


CAGATTGTGT 


7150 


CACTAGAGTA 


ACAGTCTGAC 


TGGCAGAGGC 


TCCAGTAGTG 


CTCAGTCCAG 


7200 


GGGCACGGTC 


aggctgcttt 


GTCCTCAGCT 


CCCGAAGTAA 


ATGGTTTACA 


7250 


GCCTCCCACT 


CAGACCTCAG 


ATCTTCTAAC 


TTCCTCTTCA 


CTGGCTGAGT 


7300 


GCTTGGTTTT 


TCCTTATACA 


AATGCTGCCC 


TTTCGACAAA 


AGCCTTTCCA 


7350 


CATCCGCTTG 


TTTACCGTGA 

* 


ACTGTTACTT 


CAATCTCCTT 


TATGTCAAAC 


7400 


GGTCCTGCCT 


GACTTGGTTG 


GTTATAAATT 


TCCAACTGGT 


TTCTAATAGG 


7450 


AGAGACCCAC 


AGAAGCAGGT 


GATCCAGCTG 


CTCTTCAAGC 


TGCCTAAAAT 


7500 


CTTTTAAGTG 


AACCTCAAGC 


TCTCCTTGTT 


TCTCAGGTAA 


AGCTCTGGAG 


7550 


ACCTTTATCC 


ACTGGAGATT 


TGTCTGTTTG 


AGCTTCTTTT 


CAAGTTTATC 


7600 
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FIGURE 126 

TTGCTCTTCT GGCCTTATGG GAGCACTTAC AAGTACTGCT CCTCCTGTTT 
CATTTAATTG TTTTAGAATT CCCTGGCGCA GGGGCAACTC TTCTGCCAGT 
AACTTGACTT GTTCAAGTTG TTCTTTTAGC TGCTGCTCAT CTCCAAGTGG 
AGTAATAGCA ATGTTATCTG CTTCTTCCAG CCACAAAACA AATTCATTTA 
AATCTCTTTG AAATTCTGAC AAGACATTCT TTTGTTCTTC AATCCTCTTT 
CTCCTTTCTG CCAGCTCTTT GCAGATGTCG TGCCACCGCA GACTCAAGCT 
TCCTAATTTT TCTTGTAGAA TATTGACATC TGTTTTTGAA GACTGTTGAA 
TTATTTCTTC CCCAGTTGCA TTCAGTGTTC TGACAACAGC TTGACGCTGC 
CCAATGCCAT CCTGGAGTTC CTTAAGATAC CATTTGTATT TAGCATGTTC 
CCAGTTTTCA GGATTTTGTG TCTTTTTGAA AAACTGTTCA ACTTCATTCA 
GCCATTGATT AAATACCTTC ATATCATAAT GAAAGTGTCG CCATTTTTCA 
ACTGATCTGT CGAATCGCCC TTGTCGTTCC TTGTACATTC TATGAAGTTT 
TTCCCCCTGG AAATCCATCT GTGCCACGGC TTCCTGTACT TTCACCTTTT 
CCATGGAGGT GGCACTTTGC AAGGCTGCTG TCTTCTTCTT GTGAATAATA 
TCAATCCGAC CTGAGATTTG TTGCAAATTG TCTTTTATAT TCTTAAGAGA 
CTCCTCTTGC TTAAAAAGAT CTTCAAAATC TTTAGCACAG AGTTCAGGAG 
TATTTAGAAG ATGATCAACT TCTGAAAGAG CTTGTAAGAT ATGACTGATC 
TCGGTCAAAT AAGTAGAAGG CACATAAGAA ACATCCAAAG GCATATCTTC 
AGTCGTCACT ACCATAGTTT CTTCATGGAG AGTGTGAATT TGTGCAAAGT 
TGAGTCTTCG AAACTGAGCA AAATTGCTCT CAATTTGCCG CCAGCGCTTG 
CTGAGCTGGA TCTGAGTTGG CTCCACTGCC ATTGCGGCCC CATTCTCAGA 
CAAGCCCTCA GCTTGCCTGC GCACTGCATT CAGCTCCTCT TTCTTCTTCT 
GCAATTCACG ATCAATTTCC TTTAATTTTC TTTCATCTCT GGGTTCAGGT 
AGGCTGGCTA ATTTTTTTTC AATTTCATCC AAGCATTTCA GGAGATCATC 
AGCCTGCCTC TTGTACTGAT ACCACTGGTG AGAAATTTCT AGGGCCTTTT 
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CATTATGTTT 


TGTCTGTAAC 


8900 


TTTCTCTCAT 


CTGTGATTCT 


8950 


TTCATTTACA 


GTACCCTCAT 


9000 


CCTCTTTCAG 


ATTCACCCCC 


9050 


AATTTTTGTA 


TATCTGAGTT 


9100 


ggcctttcca 


GTCTTAATTC 


9150 


GCTCAGAGAT 


TTGGGGCTCT 


9200 


GCCATCAATT 


TTGCTGCTTG 


9250 


GTCATTCATT 


TCAGCCTTTA 


9300 


GTTTCTTCTT 


TTCAGACTCA 


9350 


TTTGTGATTT 


GTTCTATGTT 


9400 


TTCCAACAAA 


AGATTTAGCC 


9450 


AGTTACTGTT 


CAGAAGACTC 


9500 


CCCAACACCA 


TTTTCAAAGA 


9550 


AGCCTTCTGT 


TTCTCAATCT 


9600 


CAGAATCCAA 


ATTACTTGGC 


9650 


TCTGTATCTG 
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TTGCTGCCAG 
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TCATCAAGCT 


CTTTGGGATT 
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TCCGGTTTTA 


ATCACCATTT 


9900 


TATACAAGTT 


CACACAATGA 


9950 


ACACTCTTGG 


TTTCCAATGC 


10000 


CATCTTACTT 


TCCTCTAGAC 


10050 


ATCGAAATTT 


CATGGAGACA 


10100 


TGTGAAAGAA 


CCCTTTGGTT 


10150 



SUBSUME SHEET (ROLE 26) 



PCT/US95/14017 



WO 96/13597 



38/45 
FIGURE 121 

GGCATCCTTC CCCTGGTTAT GTTTCTTCAT TTCTTCTAAA CTTATCTCAT 
GACTTGTCAA ATCTGATTGG ATTTTCTGGG CTTCCTGAGG CATTTGAGCT 
GCATCCACCT TGTCAGTGAT ATAAGCTGCC AACTGCTTGT CAATGAATTC 
AAGCGACTCC TGAATTAAGT GCAAGGACTT TTCAATTTCC TGGGCAGACT 
GGATACTCTG TTCAAGCAAC TTTTGTTTCC TCACAGCCTC TTCATGTAGT 
TCCCTCCAAC GAGAATTAAA CGTCTCAAGC TCCTCATTGA TCAGTTCATC 
CATGACTCCT CCATCTGTAA GAGTCTGTGC CAATAGACGA ATCTGATTTG 
GGTTCTCCTC TGAATGATGC ATCAGATTTT CAAGAGATTC TAGCACTTCA 
GTGATTTCCT CAGGTCCTGC AGGAACATTT TCCATGGTTT TAAGTTTCAA 
TTCTACTTCA TTGAGCCACT TGTTTGCTTT CTCTAAATAT GACAATAACT 
CATGCCAACA TGCCCAAACT TCTTCCAAAG TTTTGCATTT TCCATTCAGC 
CTGGTGCACA GCCATTGGTA GTTGGTGGTC AGAGTTTCAA GTTCCTTTTT 
TAAGGCCTCT TGTGCTGAGG GTGGAGCGTG AGCTATTACA CTATTTACAG 
TCTCAGTAAG GAGTTTCACT TTAGTTTCTT TTTGTAGTGC CTCTTCTTTA 
GCTCTCTTCA TTTCTTCAAC AGCAGTCTGT AATTCATCTG GAGTTTTATA 
TTCAAAATCT CTCTCTAGAT ATTCTTCTTC AGCTTGTGTC ATCCACTCAT 
GCATCTCTGA TAGATCTTTT TGGAGGCTTA CGGTTTTATC CAAACCTGCC 
TTTAAGGCTT CCTTTCTGGT GTAGACCTGG CGGCATATGT GATCCCACTG 
AGTGTTAAGC TCTCTAAGTT CTGTCTCCAG TCTGGATGCA AACTCAAGTT 
CAGCTTCACT CTTTATCTTC TGCCCACCTT CATTAACACT ATTTAAACTG 
GGCTGAATTG TTTGAATATC ACCAACTAAA AGTCTGCATT GTTTGAGCTG 

TCTTTCAGGA 

GTGATTCTGA 



TTTTTTCAGG ATTTCAGCAT CCCCCAGGGC AGGCCATTCC 
AAACATCAAC TTCAGCCATC CATTTCTGTA AGGTTTTTAT 
AATTTTCGAA GTTTATTCAT ATGTTCTTCT AGCTTTTGGC AGCTTTCCAC 
CAACTGGGAG GAAAGTTTCT TCCAGTGCCC CTCAATCTCT TCAAATTCTG 
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ACAGATATTT 


CTGGCATATT 


TCTGAAGGTG 


CTTTCTTGGC 


CATCTCCTTC 


11450 


ACAGTGTCAC 


TCAGATAGTT 


GAAGCCATTT 


TGTTGCTCTT 


TCAAAGAACT 


11500 


TTGCAGAGCC 


TGTAATTTCC 


CGAGTCTCTC 


CTCCATTATT 


TCATATTCAG 


11550 


TAACACTAAG 


ATAAGGTACA 


GAGAGTTTGC 


TTTCTGACTG 


CTGGATCCAC 


11600 


GTCCTGATGC 


TACTCATTGT 


CTCCTGATAG 


CGCATTGGTG 


GTAAAGTGTC 


11650 


AAAAATTGTC 


TGTAGCTCTT 


TCTCTTTGGC 


CCTCACACCA 


TCAAAGATGT 


11700 


GGTTAAAATG 


ATTAGTAAAG 


GCCACAAAGT 


CTGCATCCAG 


AAACATTGGC 


11750 


CCCTGTCCCT 


WTCTTTCAG 


TTGTAGACTC 


TGAATTTTTA 


ATTGCTCAAT 


11800 


TTGAGGCTGA 


AGAGCTGACA 


ATCTGTTGAC 


TTCATCCTTA 


CAAATTTTTA 


11850 


ACTGGCTTTT 


AATTGCTGTT 


GGCTCTGATA 


GGGTGGTAGA 


CTGGGTTTTC 


11900 


AACAAGTTTT 


CGGCAGTAGT 


TGTCATCTGT 


TCCAATTGTT 


GTAGCTGATT 


11950 


ATAAAAGGTA 


ATGATGTTGG 


TTTGATACTC 


TAGCCAGTTA 


ACTCTCTCAC 


12000 


TCAGCAATTG 


GCAGAATTCT 


GTCCACCGGC 


TGTTCAGTTG 


TTCTGAAGCT 


12050 


TGTCTGATAC 


TTTCAGCATT 


AACACCCTCA 


TTTGCCATCT GTTCCACCAG 


12100 


GGCCTGAGCT 


GATCTGCTGG 


CATCTTGCAG 


TTTTCTGAAC 


TTCTCTGCTT 


12150 


TTTCTCGTGC 


TATGGCATTG 


ACTTTTTCTT 


GCAAGTCTGA 


GATGTTGCCT 


12200 


TCTTTTCGAT 


AGACTGCAAA 


TTCAGAACTC 


TGTAATACAG 


CTTCTGAACG 


12250 


AGTAATCCAA 


CTGTGAAGTT 


CAGTTATATC 


GACATCCAAC 


CTTTTCCTGA 


12300 


GTTCAGAATC 


CACAGTTATC 


TGCCTCTTCT 


TTTGAGGAGG 


TGGTGGTGGA 


12350 


AGTTCCTCTT 


GGG CATGTTT 


TACCATGATT 


TGTTCCCTTG 


TGGTCACCAT 


12400 


AGTTACCGTT 


TCCATTACAG 


TTGTCTGTGT 


TAGGGATGGT 


TGAGTGGTGG 


12450 


TGACAGCCTG 


TGAAATTTGT 


GCTGAACTCT 


TTTCAAGTTT 


TTGGGTTAAA 


12500 


TTGTCCCAAC 


GTTGTGCAAA 


GTTTTCCATC 


CAGATTTCCA 


TCTTTTGAGT 


12550 


CACTGACTTA 


TTTTTCAGTG 


CCGAAAGTAG 


ATCTTGATTG 


AGTGAACTTA 


12600 


GTTTTTCCAT 


GGTTGGCTTT 


TTCTTTTCTA 


GATCTATTTT 


TAAAGTAGAT 


12650 



SUBSTITUTE SHEET (RULE 26) 



PCTAJS95/14017 



WO 96/13597 



40/45 
FIGURE 12K 

ATTTTGTGAA GACTTGACAT CATTTCATTT TGATCTTTAA AGCCACTTGT 
CTGAATGTTC TTCATTGCAT CTTCTTTTTC TGAAAGCCAT GTACTAAAAA 
GGCACTGTTC TTCAGTAAAA TGCTGCCATT TTAGAAGAAT ATCTTGTAAA 
ACAATCCAGC GGTCTTCAGT CCATCTGCAG ATATTTGCCC ATCGATCTCC 
CAGTACCTTA AGTTGTTCTT CCAAAGCAGC TGTTGCATGA TCACCGCTGG 
ATTCATCAAC CACTACTACC ATGTGAGTGA GCGAGTTGAC CCTGACCTGC 
TCCTGTTCTA GATCTTCTTG AAGCACCTTA TGTTGTTGTA CTTGGCATTT 
TAGATCTTCA AGATCAGGTC CAAAGGGCTC TTCCTCCATT TTCTTAGTTC 
TCTCTTCAGT TTTTGTTAAC CAGTCATCTA GTTCTTTTAA TTTCTGATTC 
TGGAGATCCA TTAGAACTTT GTGTAATTTG CTTTGTTTTT CCATGCTAGC 
TACCCTGAGA CATTCCCATC TTGAATTTAG GAGATTCATT TGTTCTTGCA 
CTTCAGCTTC TTCATCTTCT GATAATTTCC CTTTTCCAAC TAGTTGACTT 
CCTAACTGTA GAACATTACC AACAAGTCCT TGATGAGATG TCAGATCCAT 
CATGAATCCC TCATGAGCAT GAAACTGTTC TTTCACTTCT TCAACATCAT 
TTGAAATCTC TCCTTGTGCT CGCAATGTAT CCTCGGCAGA AAGAAGCCAT 
GAAAGTACTT CTTCTAAAGC AGTTTGGTAA CTATCCAGAT TTACTTCCGT 
CTCCATCAAT GAACTGTCAA GTGACTTGTC TCTGGGAGCT TCCAAATGCT 
GTGAAGGATA GGGGCTCTGT GTGGAATCAG AGGTGGCAAC ATAAGCAGCC 
TGTGTGAAGG CATAACTCTT GAATCGAGGC TTAGGAGATG AAGAAGTTTG 
TTCATAGCCC TGTGCTAGAC TGACTGTGAT CTGTTGAGAG TAATGCATCT 
GGTGATGTAA TTGAAAATGT TCTTCTCTAG TTACTTTTGA AGATGTCCTG 
GGCAACATTT CCACTTCTTG AATGGCTTCA ATGCTCACTT GTTGTGGCAA 
AACTTGAAAG AGTGATGTGA TGTACATTAA GATGGACTTC TTGTCTGGAT 
AAGTGGTAGC AACATCTTCA GGATCAAGAA GTTTTTCTAT GCCTAACTGG 
CATTTTGCAA TGTTGAAGGC ATGTTCCAGT CTTTGGGTGG CTGAGTGCTG 
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GGGCCTGTGA 


CTATGGATAA 


13950 


TAGAGGTGAA 


GTTGATGACG 


14000 


CGAACCCAGC 


TCAGAAGAAT 
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CI TGATAGTT 


TTCATCACAT 


14100 
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FIGURE 12M 

ATGACTAATA CGTAGATGTA CTGCCAAGTA GGAAAGTCCC ATAAGGTCAT 
GTACTGGGCA TAATGCCAGG CGGGCCATTT ACCGTCATTG ACGTCAATAG 
GGGGCGTACT TGGCATATGA TACACTTGAT GTACTGCCAA GTGGGCAGTT 
TACCGTAAAT ACTCCACCCA TTGACGTCAA TGGAAAGTCC CTATTGGCGT 



TACTATGGGA 
GCGGTCAGCC 



ACATACGTCA TTATTGACGT CAATGGGCGG 
AGGCGGGCCA TTTACCGTAA GTTATGTAAC 
TCGACTCTAG AGGATCTCCC TAGACAAATA TTACGCGCTA 
AAAATTATTC AGATTTCACT 
CCAAATCTTA CTCGGTTACG 
CGCGCGAAAA 



ACTTTTGCCA 



TTGTCACTTC 
CATCCGTCGC 



TCCTCTTATT 
CCCAAATTTA 
CTGTGTACAC 
TTACATGTGT 



CAGTTTTCCC 
CTACAACATC 
CGGCGCACAC 
TCCGCCACAC 



ACACTTCCGC CACACTACTA CGTCACCCGC CCCGTTCCCA 
CACGTCACAA ACTCCACCCC CTCATTATCA 

ATTGATGATG CTAGCGGGGC 
CATGACAGAT CTGCGCGCGA 

TTGCGCATGC TAGCTATAGT TCTAGAGGTA 

TTAATAGGCC 



AAGGTATATT 
CAATGATCAT 



GCTACGTATA 
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GTACAGAGCT 
TTGGCACTGG 
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CCGTCGTTTT 
AATCGCCTTG 
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GTGGGCCGTG 
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GGGTCGTTGG 
GACCTGCAGG 
TGAGTAACAC 
GCGAAAATGG 
CGCCTAAAAC 
CAAAAACGTC 
TTGCAACATC 
CGCCCCGCGC 
AATCCAAAAT 
GATCCAATTG 
CGCTTTAAAT 
ACGTTAGCCG 
ATGGCGGCCG 
CCGCGGACAT 
CACCTTAAGC 
ACCCTGGCGT 
AGCTGGCGTA 
GCGCAGCCTG 



ATAGCGAAGA GGCCCGCACC GATCGCCCTT CCCAACAGTT 
AATGGCGAAT GGCGCCTGAT GCGGTATTTT CTCCTTACGC ATCTGTGCGG 
TATTTCACAC CGCATACGTC AAAGCAACCA TAGTACGCGC CCTGTAGCGG 
CGCATTAAGC GCGGCGGGTG TGGTGGTTAC GCGCAGCGTG ACCGCTACAC 
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FIGURE 12N 

TTGCCAGCGC CCTAGCGCCC GCTCCTTTCG CTTTCTTCCC TTCCTTTCTC 16500 

GCCACGTTCG CCGGCTTTCC CCGTCAAGCT CTAAATCGGG GGCTCCCTTT 16550 

AGGGTTCCGA TTTAGTGCTT TACGGCACCT CGACCCCAAA AAACTTGATT 16600 

TGGGTGATGG TTCACGTAGT GGGCCATCGC CCTGATAGAC GGTTTTTCGC 16650 

CCTTTGACGT TGGAGTCCAC GTTCTTTAAT AGTGGACTCT TGTTCCAAAC 16700 

TGGAACAACA CTCAACCCTA TCTCGGGCTA TTCTTTTGAT TTATAAGGGA 16750 

TTTTGCCGAT TTCGGCCTAT TGGTTAAAAA ATGAGCTGAT TTAACAAAAA 16800 

TTTAACGCGA ATTTTAACAA AATATTAACG TTTACAATTT TATGGTGCAC 16850 

TCTCAGTACA ATCTGCTCTG ATGCCGCATA GTTAAGCCAG CCCCGACACC 16900 

CGCCAACACC CGCTGACGCG CCCTGACGGG CTTGTCTGCT CCCGGCATCC 16950 

GCTTACAGAC AAGCTGTGAC CGTCTCCGGG AGCTGCATGT GTCAGAGGTT 17000 

TTCACCGTCA TCACCGAAAC GCGCGAGACG AAAGGGCCTC GTGATACGCC 17050 

TATTTTTATA GGTTAATGTC ATGATAATAA TGGTTTCTTA GACGTCAGGT 17100 

GGCACTTTTC GGGGAAATGT GCGCGGAACC CCTATTTGTT TATTTTTCTA 17150 

AATACATTCA AATATGTATC CGCTCATGAG ACAATAACCC TGATAAATGC 17200 

TTCAATAATA TTGAAAAAGG AAGAGTATGA GTATTCAACA TTTCCGTGTC 17250 

GCCCTTATTC CCTTTTTTGC GGCATTTTGC CTTCCTGTTT TTGCTCACCC 17300 

AGAAACGCTG GTGAAAGTAA AAGATGCTGA AGATCAGTTG GGTGCACGAG 17350 

TGGGTTACAT CGAACTGGAT CTCAACAGCG GTAAGATCCT TGAGAGTTTT 17400 

CGCCCCGAAG AACGTTTTCC AATGATGAGC ACTTTTAAAG TTCTGCTATG 17450 

TGGCGCGGTA TTATCCCGTA TTGACGCCGG GCAAGAGCAA CTCGGTCGCC 17500 

GCATACACTA TTCTCAGAAT GACTTGGTTG AGTACTCACC AGTCACAGAA 17550 

AAGCATCTTA CGGATGGCAT GACAGTAAGA GAATTATGCA GTGCTGCCAT 17600 

AACCATGAGT GATAACACTG CGGCCAACTT ACTTCTGACA ACGATCGGAG 17650 

GACCGAAGGA GCTAACCGCT TTTTTGCACA ACATGGGGGA TCATGTAACT 17700 
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FIGURE 120 

CGCCTTGATC GTTGGGAACC GGAGCTGAAT GAAGCCATAC CAAACGACGA 
GCGTGACACC ACGATGCCTG TAGCAATGGC AACAACGTTG CGCAAACTAT 
TAACTGGCGA ACTACTTACT CTAGCTTCCC GGCAACAATT AATAGACTGG 
ATGGAGGCGG ATAAAGTTGC AGGACCACTT CTGCGCTCGG CCCTTCCGGC 
TGGCTGGTTT ATTGCTGATA AATCTGGAGC CGGTGAGCGT GGGTCTCGCG 
GTATCATTGC AGCACTGGGG CCAGATGGTA AGCCCTCCCG TATCGTAGTT 
ATCTACACGA CGGGGAGTCA GGCAACTATG GATGAACGAA ATAGACAGAT 
CGCTGAGATA GGTGCCTCAC TGATTAAGCA TTGGTAACTG TCAGACCAAG 
TTTACTCATA TATACTTTAG ATTGATTTAA AACTTCATTT TTAATTTAAA 
AGCATCTAGG TGAAGATCCT TTTTGATAAT CTCATGACCA AAATCCCTTA 
ACGTGAGTTT TCGTTCCACT GAGCGTCAGA CCCCGTAGAA AAGATCAAAG 
GATCTTCTTG AGATCCTTTT TTTCTGCGCG TAATCTGCTG CTTGCAAACA 
AAAAAACCAC CGCTACCAGC GGTGGTTTGT TTGCCGGATC AAGAGCTACC 
AACTCTTTTT CCGAAGGTAA CTGGCTTCAG CAGAGCGCAG ATACCAAATA 
CTGTTCTTCT AGTGTAGCCG TAGTTAGGCC ACCACTTCAA GAACTCTGTA 
GCACCGCCTA CATACCTCGC TCTGCTAATC CTGTTACCAG TGGCTGCTGC 
CAGTGGCGAT AAGTCGTGTC TTACCGGGTT GGACTCAAGA CGATAGTTAC 
CGGATAAGGC GCAGCGGTCG GGCTGAACGG GGGGTTCGTG CACACAGCCC 
AGCTTGGAGC GAACGACCTA CACCGAACTG AGATACCTAC AGCGTGAGCT 
ATGAGAAAGC GCCACGCTTC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG 
TAAGCGGCAG GGTCGGAACA GGAGAGCGCA CGAGGGAGCT TCCAGGGGGA 
AACGCCTGGT ATCTTTATAG TCCTGTCGGG TTTCGCCACC TCTGACTTGA 
GCGTCGATTT TTGTGATGCT CGTCAGGGGG GCGGAGCCTA TGGAAAAACG 
CCAGCAACGC GGCCTTTTTA CGGTTCCTGG CCTTTTGCTG GCCTTTTGCT 
CACATGTTCT TTCCTGCGTT ATCCCCTGAT TCTGTGGATA ACCGTATTAC 
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FIGURE 12P 

CCGCTCGCCG CAGCCGAACG 
GCGGAAGAGC GCCCAATACG 
TCATTAATGC AGCTGGCACG 
AGCGCAACGC AATTAATGTG 
TTACACTTTA TGCTTCCGGC 
ACAATTTCAC ACAGGAAACA 
CATGGGACGT CGACCTGAGG 
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